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One of the perennial problems of 

educational statistics is the interpre- 
tation of the coefficient of correlation 
between two variables. This is, to be 
sure, less of a problem for the limited 
few who (through good fortune or other- 
wise) have had extensive experience with 
all degrees of correlation, both high 
and low. And yet, even in this excep- 
tional group, one occasionally finds a 
misunderstanding or misstatement which 
can only be considered amazing. Thus, @ 
reputable statistician of the last dec- 
ade has written that a correlationof .75 
between two variables means a closeness 
of correspondence equal to "about 75 per 
cent of perfect interdependence" (6, 
p- 32). On no reasonable grounds is this 
interpretation of a correlation of .75 
generally defensible (7, pp. 264-266; 12, 
14). 

Misinterpretation of the corre- 
lation coefficient is more likely to oc- 
cur when the complicating factor of er- 
rors-of-measurement enters in. Suppose, 
for example, that it is desired to pre- 
dict a young person's aptitude for, let 
us say, medical research. In attacking 
this problem, a quantitative criterion 





of success in medical research would 
typically be set up; tests of ability 
(and possibly personality) would be de- 
vised to predict the criterion; and the 
multiple correlation between the tests 
and the criterion would be obtained. 
Even if this multiple correlation were 
as high as .75, the "index of forecast- 
ing efficiency"* (7, pp. 268-271) would 
be something less than 35 per cent. This, 
of course, represents an unsatisfactory 
state of affairs, especially since the 
multiple correlation between a test bat- 
tery and a criterion is commonly below 
75. 

At this point it is, however, in- 
portant to remember that the correlation 
between a test and a criterion is ordi- 
narily the correlation between the test 
and a decidedly fallible criterion. The 
low correlation between a test and a 
criterion may obviously arise not merely 
through defects in the test, but also 
through defects in the criterion. It is 
desirable, therefore, to know not only 
the index of forecasting efficiency for 
the raw correlation between the test and 
the criterion (i.e., Egy, where the let- 
ter E stands for "efficiency", the 





l. The writers are indebted to Professor Harold E. Jones, Professor Noel Keys, Dr. Harold D. Carter, 
and Miss Ruth H, Krause for reading and criticism of the manuscript. For assistance in the com- 
putation of Table 1, we are indebted to Mrs. Lina Hutson Aylesworth. 

2. The "index of forecasting efficiency" is 1 - \/1 - r#,. The radical \/1 - r2, (termed the "co- 
efficient of alienation") is the generally accepted measure of the degree to which test x has 


failed to predict the criterion, c; the quantity l - 


V1 - r2., or E,,, states the degree to 


which test x has succeeded in predicting the criterion, c. 
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subscript "c*® stands for the criterion, 
and "x" for the test); it is desirable 
to know also the index of forecasting ef- 
ficiency for the correlation between the 
test and the "true" criterion--i.e., the 
criterion freed from random errors of 
We want, in other words,not 
(where the subscript "c," 
stands for a "true" score in the criteri- 
yn; and "x", as before, stands for the 
fallible text).5 In general, the effi- 
ciency of a test in predicting a true 
criterion (Ec. ~*® is greater than its ef- 
ficiency in predicting the fallible. cri- 
terion (Ecy); for in the former case, 
random errors of measurement in the cri- 
terion are balanced out, and this source 
of discrepancy between test-scores and 
criterion-scores is thus eliminated. The 
index, Ec x, is always greater than Ec,y, 
unless (the reliability coefficient 
of the cot tarion) be 1.00; but this of 
course never happens in actual practice. 
The value of the index of fore- 
casting efficiency in the case of a true 
criterion (Eq,x) may be quite simply ob- 
tained, as follows. By definition, 


measurement. 
Eex,» but Ec 


0) . 
© 4 Ca 
0 
Cy 


2 
or l - | fata * 


E 
cx 


The value of r& , may most conveniently 
be calculated from the formula, 


r2 
c= 


is the Spearman-Brown reli- 


wnere Teic, 





ability coefficient of c, obtained by 
correlating "split halves" and then ap- 
plying the Spearman-Brown formula. This 
formula is given (in somewhat different 
notation) by Kelley (8, p. 201). Sub- 
stituting this value of rR in the defi- 
nition of Eo x above, we ffnd-- 


When, by supposition, the criterion is 
perfectly reliable (1.e., rg we * 1.00), 
formula (1) reduces, as it should, to 


punemanmae 
1-y1- ra, or E,y, the conventional 
"index of forecasting efficiency." The 
use of formula (1) is, of course, re- 
stricted to the cases where the Spearman- 
Brown technique is considered applicable, 
(Alternative methods for the evaluation 
ofr and the determination of Eq x, 
are presented in the Supplementary Note 
at the end of the present paper.) 
It is of some interest to compare the 
function in formula (1), 
2 
Tox 
i « 
TeiCs 
(or as we may symbolize it, k, x), with the more 
familiar "coefficient of alienation", 1 - ra, 
(or kgy)- The two functions are identical, ex- 
cept that in the former, rg, is divided by the 
decimal quantity, To,c,- It is well known that 
the value of 


1- rt, 





5. It is not difficult to imagine cases where the unreliable criterion, despite its unreliability, 


should be considered final and absolute. 
rather than scholastic aptitude for college. 
is the sole interest, Ec, and not Ec 


is the proper measure of predictive efficiency. 


Thus, one may wish to predict actual college grades, 
In such a case, if the prediction of actual grades 


To the 


writers, however, it appears indefensible to limit one's interest and effort to this one prac- 


tical problem. 


at hand (as would be shown by a wide discrepancy between 
to eliminate the unreliability and attendant unfairness of 


If, for example, the college grades were seriously unreliable for the purpose 


and Ecx), steps should be taken 
e grades. 


Besides random errors of measurement in the criterion, reflected in the reliability co- 


efficient, there are, of course, various systematic errors. 
also impair the validity of a criterion. 


fect individual scores unequally, 


rection for such systematic errors is attempted in the present paper. 


These, to the extent that they af- 
No statistical cor- 
A valid correction for 


systematic errors would probably require more information concerning the nature and intensity 


of these errors than is generally available. 
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drops very slowly for changes in r,, from, say, 
.00 to .30, and very rapidly for changes in roy 
from, say, -95 to 1.00 (2, p. 299). The func- 
; | rox 
tion, \/ - ’ 
V Teics 
in even more pronounced form, since the effect of 
division by the decimal quantity, ro,c,, is to 
increase the higher values of r., more than the 
lower;* and this accentuates the already pro- 
nouncedly differential rate of decline of ko, 
for increasing values of ro,;. 
Table 1 gives values of the in- 
dex of forecasting efficiency (with cor- 


shows the same behavior, but 


rection for random errors in the criteri- | 


on), for various values of roy (correla- 

tion between test and criterion) and 
(reliability of criterion).5 The 

rinst column of Table 1 (in which ro.o, 

is taken as 1.00) gives values of the in- 

dex of forecasting efficiency when (by 





supposition) random errors do not occur 
in the criterion-measurement at all. n 
glance along the rows of the table, from 
this first column, serves to indicate 
the influence of unreliability of the 
criterion upon values of the corrected 
index of forecasting efficiency, for 
given values of rex. A glance down the 
columns of the table serves to indicate 
the effect of changes in rey upon values 
of the corrected index of forecasting 
efficiency, for a fixed value of ro.c,, 
It may be noticed, in the table, that no 
data are included for values of roe.c, 
below .30; it was felt that tests with a 
reliability below .30 are of ee OY 
practical or theoretical interest.’ Be- 
tween .30 and .60, Toice is tabled in 


intervals of .05; between .60 and .980, 
in intervals of .01; between .980 and 
in intervals of .005. 
tg 


1.000, The values 





4, Thus, when rex is .50 and ro.,, is .80, ro, = 


2 
Tox’ 


before) .80, then rs. = .56 and 


- 56 


—* -80 


. Table 1 was independently computed by both authors. 


This may be considered a i ee modest rise. 
or .70--a rise of 


25 and = 
r 


25 
-80 


r 
= or .3l--a rise of .06 above 
C1Cg 


But when roy = .75 and ro,,, is (as 


14 above r2,. 


The first method of calculation made use of 


formula (1), the value of the expression within the radical, 


\/ 


, ** 


2 
Tox 
Te1ce 


being computed correct to six decimals, and the square root obtained correct to three decimals 


with the aid of Barlow's Tables (1). 


SS S= 
E.z7z1l-yi1-ri 
on 


Cox? Tox 





The second method of computation employed the formula, 
being computed by the formula 


Tox 


r 2, 
“ot y Teica 


and V 1 - re 
use of Miner's Tables (11). 


being then obtained (without interpolation for the fifth decimal in Tox) by the 
These two methods of computation could not be expected to yield ex- 


actly the same answer in every instance, since the absence of interpolation in the second method 
occasionally introduces a slight error. All such discrepancies by the two methods were, of 
course, investigated and resolved. The tables, as published, should be correct to the number of 
decimals given. 

The blank spaces in Table 1 require a word of explanation. The correlation, r,, cannot (except 
by chance) exceed the correlation between actual scores in the criterion and true scores in the 
This maximum correlation (symbolized as Toc.) is termed the "index of reliability"; 


numerically, it is equal to \// Tose (5, pp. 272-273). In Table 1, the first blank space which 


occurs refers to the combination, Tc, cg = -995, Toy = 1.000). Now, when Toic, ~ 995, the 
highest possible value of r,, is \/.995 or .997; the combination in the table, therefore (in 


which rox = 1.000 and ro.¢, = -995) is impossible, or imaginary. Hence, the value of Eo x has 
not been computed for this combination. Indeed, if one attempted to compute Eo x for this com- 


bination, one would obtain the value 
a r 
1 - 95 = V2 


A similar explanation applies to all the blank spaces in the 


criterion. 





- 1.005 = \/-.005, 
which is an imaginary number. 
table. 

7. Professor T. L. Kelley, for example, has recently suggested (in connection with the reliability 
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of rey are also tabled in terms of a4 
somewhat similarly varying interval, ex- 
cept that values of rey are included 
down to .00. 

Table 1 serves to emphasize cer- 
tain facts which, at times, seem insuf- 
ficiently appreciated. It has for some 
time been generally realized thataslight 
change of r in the region of, say, rex 
-95, is quite significant--much more 
Significant than a numerically equal 
change in the region of, say, fcx=.30. 
But it does not seem to have been equal- 
ly well realized that, when rc,c, 1s 
(shall we say) .60, a change in roy from 
-76 to .77 (if based on sufficient cases 
to be reliable) would signify a greater 
improvement in the corrected index of 





forecasting efficiency, than a change in 
rex from .96 to .99 when re,c, = 1-00. 
(As shown in Table 1, the change in pre- 
dictive efficiency in the former case is 
from .807 to .891, in the latter case, 
from .801 to .859.) In short, the re- 
gion of rey where a slight change in nu- 
merical values becomes significant, is 
determined not alone by rex, but also-- 
and to an important degree--by ro,c,, the 
reliability of the criterion. This ef- 
fect is particularly noticeable in con- 
nection with the higher values of roy, 
and also in connection with the very low 








values of re,c,- Iilustrations: When 
Pe,c, 18 +55, the change in rey from .58 
to .59 Signifies more improvemement in 


the corrected index of forecasting effi- 
ciency, than the change in rey from .985 
to .995 when r¢,,, = 1.00. (The change 
in efficiency in the former case is from 
-803 to .926, in the latter case is from 
-827 to .900.) When re.c¢, is .61, the 
chang? in rey from .89 to -90 signifies 
slightly more improvement in the correct- 
ed index of forecasting efficiency, than 
the change in r,, from .990 to 1.000 when 
Te,c, = 1-00. (The change in efficiency 
in the former case is from .851 to 1.000, 
in the latter is from .859 to 1.000.) 
Another fact in Table 1, worthy 
of separate and explicit mention, is the 








varying effect of a given difference in 
'c,c, upon different levels of rex. If 
one examines along the rows of Table 1 
in the region of the higher values of 
Pox (say around .60), one may be im- 
pressed by the definite increase in the 
value of Ec y as the value of ro,cg, de- 
creases (rey itself remaining constant). 
Evidently, then, when rey is high, low 
reliability of the criterion is a sig- 
nificantly limiting factor upon the prac- 
tical efficiency of test-prediction. But 
in the region of lower values of rex 
(say around .30), a glance along the rows 
of Table 1 shows that, while the values 
of Eg x do rise somewhat as rce,c, de- 
creases, the rise is quite small. For 
low values of rex, then, changes in the 
reliability of the criterion influence 
the predictive efficiency of a test _ so 
slightly, as to be practically negligi- 
ble. This is not to say that, with a 
low value of rey, improvement in the 
validity or nature of the criterion will 
fail to be of any service; it is merely 
to emphasize that--unless the reliabil- 
ity of the criterion is extraordinarily 
low--very poor predictive efficiency in 
a test is not attributable to random er- 
rors of measurement in the criterion. 
The first column of Table 1 gives values 
of E when roic, Of formula (1) equals 1.000; 
the values in this column, as previously stated, 
are identical with the values of Ecy. The other 
values of E in Table 1, to the extent that 
they differ from Ecy and are based on fallible 


values of roy and Te,co ares of course, merely 
estimates; these values of E state the fore- 


casting efficiency that would occur, if rex and 
Te,cg were exactly as found in the given sample, 
and if the reliability of the criterion were 
lifted from its sample-value to unity. The 
larger the difference between E and E,;, the 
greater is the departure from actual empirical 
finding, and the greater, correspondingly, should 
probably be the caution of interpretation. A 
large difference between E and Ecy will, in 
practice, generally arise from a low value of 
the reliability coefficient, rc,c.- The conse- 
quence of a low reliability coefficient is, in 




















(Footnote continued) of a certain test in the ninth grade) that, "to yield a serviceable group 
test, the reliability should be .40 or better" (10, p. 300). 


Previously, Professor Kelley had 


Suggested .50 as the lower limit of acceptability for the reliability of a test in a single 


school grade (9, p. 211). 
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the first place, an increase in the sampling er- 


ror of E, x; this may, in a sense, be experimen- 
tally compensated for, simply by including suf- 
ficiently more cases in the sample. A second 

difficulty arising from a low reliability coef- 


ficient is the increased difficulty of interpre- | 
| recourse is to estimate the value of 


tation of Ec x. The lower the reliability co- 
efficient of the criterion, other things being 
equal, the greater will be the uncertainty as to 
whether or not the criterion suffers not only 
from random errors, but also from various un- 
known systematic errors. Since the effect of 
such systematic errors may be either to raise or 
lower rox from its proper value, the tabled val- 
ues of E in such a case may be either too 
high or too low. The point we wish to make is 
that the availability of a correction for the 
unreliability of the criterion should certainly 
not be taken as a quite satisfactory substitute 
for actual high reliability; and given high re- 
liability, Eo x will be close to Ecx. 

Figure 1, on the following page, 
presents, in graphical form, the valueof 
Eco,x corresponding to roy, when re.c, 
equals, respectively, .40, .50, .60, .70, 
.80, .90, and 1.00.8 The heavy 
Figure 1 (giving values of Ecx When 
Te,c, = 1-00) is identical with the curve 
of the uncorrected index of forecasting 
efficiency, Ecy. Comparison between this 
heavy curve and the others will serve to 
emphasize two facts already indicated 
above: (1) The lower the reliability of 
the criterion (re,c,), the greater the 
difference between the corrected and the 
uncorrected index, for a given value of 
Tex; and (2) the higher the value of rex, 
the greater the effect of a small change 
in rex upon the value of Ec x, for a 
given value of rc,c,- Both these trends 
are more conspicuous for the higher val- 
ues of rox, and negligible for very low 
values of rox. 

The use of Table 1 presupposes 
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precise values of rey (the correlation 
between the test and the criterion), and 
also of re,c, (the reliability coeffi- 
Too often, the 
actly known. In such a case, the only 
Te,cg» and to use the resulting value of 


Ec x with due discretion. Certainly it 
than to make the 


of forecasting efficiency as if the re- 


| liability of the criterion were really 


1.00. 
In notices and advertisements of 


| educational tests, one frequently runs 


across such a statement as, "The test 
marks about as 
well as the marks correlate with them- 
The implication seems to be 
that such a test is about as good as any 
for (the 
question is implied) how could a test be 
expected to correlate higher with grades, 
than the grades correlate with them- 
selves? This implication we consider 
Certainly we may legitimate- 
ly ask that a reliable test, designed to 
Supply an adequate measure of achieve- 
ment in a given subject, should corre- 
late significantly higher with grades in 
the subject, than two unreliable grades 
correlate with each other. As a matter 
of fact, reference to Table 1 shows that, 
when cox equals, say, .65, and ro.g, 
equals .70, the index of forecasting ef- 


| ficiency (even after correction for ran- 


dom errors of measurement in the criteri- 
on) is only .370 or 37.0 per cent.9 This 
can hardly be considered a degree of ef- 
ficiency worth boasting of, certainly 

not for individual guidance. With the 
reliability of grades equal to .70, the 
correlation between the test and grades 





8. In practice, a value of 1.00 for rce,c, would probably never occur, any more than would the value 


ex = 1.00, or E = 1.00. 
their interest as theoretical upper limits. 


All these values are, however, included in Figure 1, because of 


To the extent that the criterion contains systematic errors affecting r,, (cf. footnote no. l, 


page 232), this figure is too low. 


Even so, however, the difference between 37.0 per cent and 


perfection is far too great to be assigned wholly to systematic inadequacies of the criterion. 
Conservative usage suggests rather that the discrepancies between test- and criterion-scores, 
persisting after correction for unreliability of the criterion, should in general be assigned 


mainly to inadequacies of the test. 
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CORRELATION BETWEEN TEST AND CRITERION (x) 


Figure 1. Plot of the corrected index of forecasting efficiency (Ex x) for 
stated values of the reliability coefficient of the criterion (ro,¢,). 


would have to equal .72, for the cor- 
rected index of forecasting efficiency 
(Ec x) to equal 50 per cent; it would 
have to equal .82 for the corrected in- 
dex of forecasting efficiency to equal 
80 per cent. Looking at Table 1, we can 





find but little comfort in the supposed- 
ly reassuring statement that "the test 
correlates about as well with grades as 
the grades correlate with each other"-- 
unless, to be sure, the self-correlation 
of the grades themselves is uncommonly 


high. 
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On the other side of the picture, 
it is clear that the thoughtless or rou- 
tine use of the uncorrected index, Eo,, 
instead of the corrected index, Eo x, 
tends to underestimate the true worth of 
tests. For example, when rey equals .70 
and To,c, equals .75, then the uncor- 
rected index, Ecy, equals only .286, or 
28.6 per cent; corrected for random er- 
rors of measurement in the criterion, the 
index (Ec._x) becomes 41.1 per cent--a 
definite improvement over the uncorrect- 
ed index. Additional illustrations of a 
similar sort can easily be found in Ta- 
ble l. 


SUMMARY 


A low correlation between a test 
and a criterion may be due to inadequa- 
cies of the test, inadequacies of the 
criterion, or both. It is frequently de- 
sirable to know the index of forecasting 
efficiency of a test, after correction 
for random errors of measurement in the 
criterion. A formula is given by which 
such a correction may be effected; in ad- 
dition, a table is presented of values of 
Ec x (the corrected index of forecasting 
efficiency), for various values of Tox 
(correlation between test and criterion) 
and of re_¢, (reliability of criterion). 
The table serves to emphasize the follow 
ing facts: 

1. The region of rex where & 
slight change in correlation becomes sig- 
nificant is determined not alone by rex, 
but also, to an important degree, by 
Te,c,* Tous, with roic, equal to .60, a 
change in rox from 76 to .77 would sig- 
nify a greater improvement in the cor- 
rected index of forecasting efficiency 
(Eo.x), than a change in rc from .98 to 
-99 when Toic, = 1.00. 

2. When roy is high (around .75 
or .80, low reliability of the criterion 
is a significantly limiting factor upon 
the efficiency of prediction by a test. 
But when roy is low (around .30), changes 
in the reliability of the criterion only 
negligibly affect the predictive effi- 
ciency of a test. The unreliability of 





a criterion would have to be extraordi- 
narily low before one could legitimately 
attribute the very low predictive effi- 
ciency of a test to random errors of 
measurement (i.e., unreliability) in the 
criterion. 

3. An educational test which 
®*correlates with grades about as well as 
grades correlate with each other", may 
still not possess a satisfactory degree 
of predictive efficiency. Thus, if 
Te,c, (reliability or self-correlation 
of grades) equals .70, and re, (correla- 
tion between test and grades) equals .65, 
the index of forecasting efficiency-- 
even after correction for random errors 
of measurement in the criterion--is only 
37.0 per cent. Consideration of system- 
atic (as distinguished from random) er- 
rors in the criterion, affecting rox, 
may justify raising somewhat this figure 
of 37.0 per cent--but hardly enough to 
reach a degree of predictive efficiency 
that could reasonably be called satis- 
factory. 

4. The efficiency of tests con- 
structed for prediction-purposes, while 
still undoubtedly lower than desirable, 
is nevertheless greater than would be in- 
ferred by use of the more commonly quot- 
ed (but not generally more valid) formu- 


la, 
Bex = 1- \f 


Figure 1 illustrates this fact, and Ta- 
ble 1 gives it a detailed, quantitative 
expression. 


1 _ 2 . 
vox 
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i SUPPLEMENTARY NOTE 


The special assumption of formula (1) is 
that two "comparable" measures of the criterion 
are available--the term "comparable" being de- 
fined to include (a) equal correlation with the 
criterion, (b) equal reliability, and (c) equal 
standard deviations. In psychological and edu- 
cational work, the two comparable measures will 
ordinarily consist either of scores from "split 
halves", scores from alternate forms, or scores 
from re-measurement. A more general formula 
than (1), not requiring the assumption of com- 
parability (but still requiring two measures of 
the criterion), may be written as follows (cf. 
reference 2, p. 30): 


Ey x =1 - > - 


In this formula, c, represents one measure of 
the criterion, and cg another; rc,c, is the cor- 
relation between the two measures. The only re- 
striction upon c, and cg in formula (2) is that, 
within the limits cf random errors, both c, and 
Cg should be measures of the same thing; within 
this restriction, c, and cg may represent "split 
halves") of a single test, or measurements by 
re-tests, or measurements by alternate forms, or 
measurements by differing techniques, etc. Form- 
ula (2), like all other formulas in this paper, 
assumes that errors of measurement are uncorre- 
lated with each other, and with the "true" meas- 
ures. 





Te, x™cgx ( 2) 


FeiCg 


Formula (2) is required whenever the two 
measures of the criterion, c, and cg, are not 
statistically "comparable." An illustration of 
this occurs when the criterion consists of rat- 
ings from, say, five judges. In such a case, it 
would be well nigh impossible to find one sub- 
group of judges whose average ratings are statis- 
tically comparable with the average ratings from 
the remaining sub-group. 

The use of formula (2) is optional when 





10, In formula (1), it will be recalled, Teicg Stood not for the correlation between two split 


halves, but for the Spearman-Brown reliability coefficient of the criterion. 


This difference in 





notation between formula (1) and formula (2) should be carefully noted. The meaning of formla 





(1) may obviously be extended to include any case where Tec, 18 the correlation between any two 
comparable measures of the criterion, and r,, is the correlation between the test, x, and either 
of the two comparable measures of the criterion. 
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two experimentally independent and tolerably com- 
parable measures of the criterion are available; 
for in this case, since the two measures are rea- 
sonably comparable, the freedom of formula (2) 
from assumptions loses most of its practical sig- 
nificance. Presumably an advantage still re- 
maining in formula (2) is that this formula, 
th in its numerator and in its denominator, 
nakes full use of all the available data; this 
should render the probable error of formula (2) 
smaller than that of formula (1). But if two ex-| 
perimentally independent measures of the criteri-| 
on are available, the term rex in the numerator | 
of formula (1) may, without essential alteration | 
of the formula, be modified to 
| 
| 





Toyx * Togx 
; 
2 


and if this is done, then forma (1) makes just 
as full use of the entire empirical data as form 
ula (2). The use of the modified formula (1), 
moreover, offers a practical economy, in that the 
numerator, Tco,xTo,x, Of formula (2) calls for the 
computation of two correlations between the cri- 
terion and x; whereas 


2 
(= + rae 
2 


may be calculated by the formula—- 





2 2 
Fez * Fou _ "(ex + ca)x 4] 
2 2 : 
i+ voices 


which involves only a single correlation between 
the criterion-measures and x. With this modifi- 
cation of formula (1), suggested by the avail- 
ability of additional data, formula (1) may be 
rewritten— 





r(¢ + Ce) 
@2 <1 11 «coe (3) 


2Toicg 





+ 
1 Toice 


where Terc, stands, here, for the correlation be- 
tween the two experimentally independent measures 
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formula (3) becomes especially significant, if 
the criterion enters into multiple correlation 
with a series of tests, instead of (as generally 
assumed in the present paper) merely into simple 
correlation with a single test, x. 

If the assumptions of the Spearman-Brown 
formula are admissible, there is no advantage in 
using formula (2) when the two measures of the 
criterion are merely split-halves of a single ex- 
perimental measurement; for in this case, formu- 
las (1) and (2) are equivalent. The proof is as 
follows. By the Spearman-Brown assumptions, 
To.x of formula (2) = Teox = Tox} hence formla 
(2) may be written— 


2 
Tox 


Egx=1- ’ fae (4) 








where rcic, is the correlation between the two 
split halves. Now, writing formula (1) in nota- 
tion conforming to the notation in formula (2) 
(cf. footnotes 1 on page 242, and 2 below), 





2 
(cy + Cg)x 


Rex 21 -\/1- = 
= T(cy + Cg)(cy + Cg) 


\ 


but 
Xe,x + Ucgx 








2 

¥ = 
(cy + cg)x ; . 
Noy Vo%, i.” aro cg 0,%cg 


22cx ° 
¢ 
No, \| 20% + 2Po.cg%e 











“ ( ONT ox 00x - rox 
NOLO, \2 + ar, a. 1+ Toc, 
Also, by the Spearman-Brown formula, 
Te. ce 


C, + Cg) (cy + cg) * 
1+ To.ce 


" 


Hence 
2 


Tic, + Cg)x 
Fe, + cg) (c, + cg) 


2 
Tox 


2 
2rox : 1 + Torcg 





1+ TeiCs ®Teice 





of the criterion.1® the practical economy of 


Teice 








ll. The derivation of this formula may be found (in a somewhat different notation) in reference 4. 
12. In formula (1), Teics stood for a Spearman-Brown reliability coefficient--cf. footnote 1 on 


page 242. 
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Whence, substituting in (5), | 2Fe ice 
| and with 7; instead of r ° 
| ee Toice CiCyg 


eS a 
oe 3 (6) 
| Te1Cg In this discussion, the advantage of 
| formula (1) has perhaps been lost sight of. The 
which agrees exactly with (4) above. | chief adventage of this formula is that it calls 
Table 1 is of service whether formula for only one experimental measure of the criteri- 
(1), (2), or (3) 18 employed. If formula (2) | on, thus reducing significantly the time and 
has been used, enter the table with cost of data-collectiopn. The price of this ad- 
at vantage is a larger number of assumptions, and 
y ToixTcax ? a higher probable error due to the paucity of 
experimental data (12). 
instead of r,,. If formula (5) has been used, 
enter the table with rz(c, + c,) instead of Fox; | 








A TABLE FOR COMPUTING BISERIAL "r*1 


by 


Laverne E. Kolbe and Harold A. Edgerton 
Occupational Research Program 
U. S. Employment Service 


The computation of any large nunm- 
ber of biserial correlations is a time- 
consuming process. In order that the com 
putations may be speeded up, a table has 
been developed. A convenient arrangement 
of the formula was used: 

M; - M p 

eer = 

t 

Mean of the continuous vari- 
able for one category of the 
dichotomized variable. 

= Mean of the continuous vari- 
able. 
Standard deviation of the 
continuous variable. 
Proportion of the total nun- 
ber of observations in the 
group from which M; is com- 
puted. 
Ordinate of the unit normal 
curve at point P. 


The table has been arranged to 
give r correctly to two decimals without 
interpolation. To enter the table one 
needs to know the value of p, and of 


(ly - Mt) por convenience, the latter 
Ot 


quantity will be called A. It has been 
suggested by J. W. Dunlap that A is much 
easier to compute if the observations be 
transmuted so that the values My and 0+ 
be such that finding the difference My, 

-~ My and dividing by 0; can be done men- 
tally. Such values as My = 50 and o; 

= 10 are excellent for the purpose. This 
transmutation scheme is of particular 
value where a large number of biserial 
correlations must be computed using the 


same continuous variable, as in the case 
of validating test items against an out- 
side criterion or against the internal 
consistency criterion of total score. 
The table is arranged as follows: 

Columns are identified by the 
value of p. Rows are identified by val- 
ues of biserial r. The table entry is 
the greatest value of 4 which, for that 
particular value of p, can give the val- 
ue of r shown for that row. In the ta- 
ble, values of A are all given to three 
decimals. The decimals have been omit- 
ted to make the table more compact. Thus, 
a table entry of 1617 is a A of 1.617. 


Directions for using the Table: 








1. Compute the values p and A. 

2. Find column p. 

3. Go down in Column p until a 
table entry is found exactly 
as large as or just larger 
than A, 

The value of biserial r is 
read at the end of that hori- 
zontal row. 

5. Assign the same sign tor as 
is attached to A. 

Example: 

Assuming that Mt is 50 and o¢ is 
10, the value of My is found to be 43.62. 
From this, it is readily seen that A is 
-.638. My is based on 32 cases out of 
164, which is 20 per cent of tne cases. 
Find column p = 20 in the table. Follow 
this down until the value of A, exactly 
as large as .638 or just larger, is found. 
The value .636 is found so the value just 
larger (.650) is used. This is in row 
r= .46. Give r the same sign as A, in 
this case r = -.46. 





1. The table was constructed by the Statistical Unit of the Occupational Research Program to facil- 


itate the computation of biserial correlations. 
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I. Comparable Tests versus 
Experimental Independence 


There are two methods, in gener- 
al, of arranging a testing program for a 
Study in which intercorrelations and re- 
liability coefficients of a number of 


variates are to be obtained. The first 
and most common methods is to give all 
the tests at one sitting, or at least on 


the 


same day. The reliability of each 
test is obtained by correlating halves 
and applying the Spearman-Brown Formula. 
This is true whether we have two forms of 
each test or only one. In the former 
the score on Form A is one half, 
the score on Form B is the other half, and 
total score on the test is the sum of 
the scores on Forms A and B. It is these 
total scores which are used in computing 
the intercorrelations. In the latter 
case, an effort is made to split the 
items into comparable halves, usually by 
grouping the odd items in one and the 
even items in another, or by taking items 
1,4,5,8,9, etc., as one form and items 
£2,3,6,7,10,11, etc., as the other. In 
this case, as in the former, the corre- 
lation between the half-tests is ob- 
tained first, and the reliability of the 
total test is estimated by the Spearman- 
Brown Formula. The total scores are used 
in computing the intercorrelations. This 
first method of procedure implies a defi- 
nition of reliability and a set of as- 
sumptions regarding the tests and half- 
tests. The reliability implied in the 
definition might be designated "instan- 
taneous reliability.* 

Any reliability coefficient may 
considered as the ratio of the vari- 


case, 


the 


vil 


be 


‘ 


ON CERTAIN ESTIMATED CORRELATION FUNCTIONS AND THEIR STANDARD ERRORS 


by 


| 
| 





1 


ances (squared standard deviations) of 
theoretical measures of the true or un- 
derlying ability, to the corresponding 
test scores. In the case of "instan- 
taneous reliability", the true ability 
means the ability of the subject, inde- 
pendent of the error of measurement of 
the test, at the exact time when he is 
tested. The error of measurement is as- 
sumed to lie entirely in the test, and 
the two half-tests are taken as equiva- 
lent random samples of all possible sets 
of items measuring the same ability. The 
assumptions implied in this first method 
of procedure may be called collectively 
the assumption of comparability. 

In all measurements involving 
reliability coefficients the assumption 
is made that the errors of measurement 
in the two half-tests are uncorrelated 
with each other and with the underlying 
ability. The assumption of comparabil- 
ity means that in addition, the two half- 
tests will have equal units of measure- 
ment (though not necessarily the same 
zero-point), equal variances (but not 
necessarily equal means), equal reliabil- 
ities, and equal correlations with any 
other measure. The assumption of com- 
parability will in general be met when- 
ever two half-tests consist of an equal 
number of equally difficult similar 
items, similarly arranged. The equali- 
ties demanded are only approximate, i.e., 
equalities within the limits of the cor- 
responding sampling errors. 

The second method of arranging 
the testing program is to give the tests 
on two separate occasions. In order to 
use this method, there must be two sepa- 
rate forms of each test, but these forms 





1. A number of the standard errors here presented were first derived by Jack W. Dunlap and the writ- 
er working together. 


A number of these were published, but several more were not. 
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need not be comparable. All the Form A 
tests are given at one time and all the 
form B tests at another. If the inter- 
val between the two testing sessions has 
been well chosen, we will then have an 
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| median fluctuation of such abilities over 


| ity of a maximum difference. 


approach to experimental independence be- | 
| The two testing periods should not come 


tween Form A and Form B of each test. 


The reliability coefficient of each test 
| should not be separated by exactly one or 


will be the correlation between Form A 
and Form B. The intercorrelation be- 
tween two tests will be some average 
‘usually the geometric mean) of two cor- 
relations: that between Form A of the 
first test and Form B of the second, and 
that between Form B of the first test 
and Form A of the second. Note that all 
correlations are between a Form A test 
and a Form B test. 
tests given at the same testing period, 
such as that between Form A of each of 
two different tests, or between Form B 

of each of two different tests, are not 
computed, since they are not experimen- 
tally independent, having been obtained 
at the same testing period. 

This second procedure also im- 
plies a definition of reliability, which 
may be called "average reliability.* The 
reliability coefficient is still the ra- 
tio of the variances of measures of the 
true ability to test scores. But the 
true ability now means, not the ability 
of the subject at the time tested (apart 
from errors of measurement in the test), 
but his average ability over any period 
long enough to include all types of 
short-time fluctuation, but short enough 
to preclude growth or decline of the 
ability. The short-time fluctuations in 
ability of the subjects have now joined 
the errors of measurement of the test, 
the true ability of a subject means his 
average true ability, and the reliabili- 
ty of a measurement includes both the re- 
liability of the test and the reliabili- 
ty of the subjects. To obtain true ex- 
perimental independence, the two testing 
periods must be separated by such an in- 
terval as to give two independent random 
samplings of the underlying abilities of 
the subjects. The average difference be- 
tween the abilities at the first and 
second testing periods must equal the 


any short period (such as a few days or 
weeks), and the probability of a zero 
difference must be equal to the probabil- 
Certain 
practical suggestions would seemin point. 
1.e., 


on the same day of the week, they 


two or any other number of weeks. If the 


| program of tests is not long, one period 


| end should 
Correlations between 


should probably come in the morning and 
the other in the afternoon. The tests 
should not be given in the same order 
(nor yet in exactly reverse order) at the 
two testing periods. At least one week- 
intervene between the two 


testing periods. If these suggestions, 


| and any others that may occur to the in- 





vestigator in connection with any par- 


| ticular testing program, are followed 

| carefully, it is fairly probable that ap- 
| proximate experimental independence will 
| be obtained. 


This second method of procedure, 


as outlined above, has several advan- 


tages over the first. In estimating re- 
liability, correcting for attenuation, 
etc., the important problems usually con- 
cern the estimation of the average true 
abilities of the subjects rather than 
their instantaneous true abilities. 
more important than this is the fact 
that with experimental independence 
achieved, the need for comparability in 
general vanishes. Form A and Form B of 
each test, as long as they measure the 
same underlying ability with uncorrelat- 
ed errors, need not be equally long, nor 
equally reliable. They need not even 
measure in the same units, and they need 
not have equal variances nor equal cor- 
relations with other measures. The er- 
rors of measurement in any two or more 
Form A tests or in any two or more Form 
B tests may be correlated without any 
harm to essential assumptions, as long 
as the errors in all Form A tests are 
uncorrelated with the errors in all Form 
B tests. In a number of cases, these 
advantages may be obtained by using the 
methods of computation appropriate to 
the procedure without experimental 


But 
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independence being present. In such are usually longer than the methods ap- 
cases the assumption of instantaneous propriate to the first procedure, but 
true ability will still be present, but they should always be used whenever com- 
the assumption of comparability will be parability cannot be either demonstrated 
avoided. These methods of computation or assumed. 





II. Estimated Correlation Functions of Comparable Tests 


The basic formula in this procedure is the Spearman-Brown Formula for esti- 
mating the reliability of a total test from the correlation between its two com- 
parable halves. The various formulas to be considered involve intercorrelations 
between total tests and reliabilities of total tests estimated from half-test cor- 
relations. In order to distinguish clearly between whole-test scores, half-test 
scores, etc., a special system of notation is advisable. This is presented in the 


table below. 
Test l Test 2 Test 3 Test 4 
"True" scores Xm Xw Xy 


Total scores Xj Xe2 X3 
Scores, Form Xj Xii Xiii 


A 
Scores, Form B XT XII XIII 


From this table it is readily seen that x, = x; + Xr» X2 = X34 + Xyyz, etc. 
; 2ry 2re 
We define r, = Pips Te = Tay ys tees and Ry = itn » Re “Tea » etc. The 
values R,, Rg, etc., are the Spearman-Brown estimates of the reliabilities of the 
total tests whose scores are X,, Xg, etc. All scores are assumed to be measured 
from their respective means as origins. 

To obtain the standard errors of functions of intercorrelation coefficients 
and Spearman-Brown reliability coefficients, we require the sampling variances of 
the two types of coefficient and the sampling covariances of all possible types of 
pairs of coefficients. The sampling variance of any measure is the square of its 
standard error, and is designated by the symbol o* with a subscript identifying the 
measure. The sampling covariance of two measures is their correlation from sample 
to sample multiplied by their two respective standard errors. It is designated by 
o with two subscripts to identify the two measures. 

The two sampling variances required are already available. 


(1 - wis)". 
of «tut", (2) 


The first is Pearson's well-known formuia for the standard error of any correlation 
coefficient computed directly from the data. The second was derived and published 
by Shen (1924). 

Of the sampling covariances required, two are also well known. These are 
the formulas for the sampling covariance of two correlation coefficients computed 
directly from the data, given first by Pearson and Filon (1898). 


1 


2 2 . 
roe = @ (Q - Tig - Tis)(2ras - Pielis) + TialisTa3)]. (3) 
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1930, A, B, C). 
in Appendix I. 
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The other sampling covariances all involve Spearman-Brown reliability coeffi- 

In order to obtain these covariances it is first necessary to note a few 
preliminary formulas, most of which have already been derived (Cureton and Dunlap, 
These derivations are repeated for the convenience of the reader 


The next sampling covariance required 
ability coefficients. 


Ri = 





Taking logarithmic differentials, 


aR, _ 


- 1 2 2 2 2 
= TisT2e + TisT23 + p Tistse (Tis + rig + Tas + Pee) 


- (PigPisTisa + TialesTaa + TisTaslae + TieTaalsa)- 


noe 
_ th ji + ri 
rig, ™ 7RQ.* \V 2 (5) 


p _ \/ i+ Ta 6) 
Tie = TI2 = 7%, \ “ (6) 





> - 2 2 , 
NO. re => ?ia(Ql - r,)(1 - rg). (7) 
1 2 2 ; 
Hopp. g* S Tiall - r;)(1 - rig). (8) 
_ 2 2 2 
ee (1 - ry) (2riaTis - TisTes - Tislas)- (9) 
if2s 2 





is that of two Spearman-Brown reli- 


2r, LT e 
PS. Re Sass © 
1 +r, L + Pe 








dr, . ar, dR cs dra dre 


+ . 











Multiplying, summing for all samples, and dividing by the number of samples, 
NOR LR; 


Substituting from (7) and simplifying, 


We require, finally, the two types of sampling covariance of a Spearman- 
Brown reliability coefficient and an intercorrelation. 
the differentials, 







aR, = (1 + r,)? . 


Ti a * P Re Te i? Be 








N 
% No, r. : Noy Tr, NOW ra Onire 





(1 + r,)(1 + rg) ri(1 + ra) re(l1 + r;) 


rile 








Nop Re = 2ria(l - R,)(1 - Re). (10) 


For the first type, taking 


2dr, 
dries = dria. 
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Multiplying, summing for all samples, and dividing by the number of samples, 










eNO, rie 


No F comes 
Rilia (1 + far” 






Substituting from (8) and simplifying, 






= ryg(1 - rig)(1 - Ri) (11) 





No 
Rilia 











For the second type, taking differentials again, 






2dr, = 
dR, = er le Gres = drgs- 









Multiplying, summing for all samples, and dividing by the number of samples, 





fNo, ras 
No, = ——_-+-—__ + 
Fas (1 + r,)* 







Substituting from (9) and simplifying, 










No = (1 - Ry) (2rielis - rialas - Tistas)- (12) 


Rilas 









The basic formulas for further derivations are (1), (2), (3), (4), (10), 
(11), and (12). In deriving any sampling variance or standard error, the system 
used consists in taking the differential or logarithmic differential of the func- 
tion, squaring, summing for all samples, dividing by the number of samples (a theo- 
retically infinite number of them), substituting from these basic formulas, and 
Simplifying. In deriving any sampling covariance, we take the differentials or 
logarithmic differentials of the two functions, multiply, sum for all samples, di- 
vide by the number of samples, substitute from the basic formulas, and simplify. 

To sum for all samples and divide by the number of samples, we simply substitute 
for each squared differential the corresponding sampling variance, and for each 
product of two differentials the corresponding sampling covariance. This system 

of derivation gives only first approximations to the sampling variances and co- 
variances desired, but closer approximations are not warranted when observed cor- 
relations obtained from the sample must be substituted in the formulas for the cor- 
responding population correlations called for (Pearson and Moul, 1927). The as- 
sumptions involved in these derivations are: 

1. That all samples are drawn from a population normally distributed witn 
respect to all the variates measured. 

2. That all samples are drawn from a population in which all the regres- 
sions are linear. 

5. That all samples are large enough so that higher powers of the sampling 
errors are small in comparison with first powers, and may be neglected. 

A further difficulty arises in interpreting the standard errors when de- 
rived. It is usual to assume that the sampling distributions of correlation func- 
tioms are normal, and to interpret their standard errors by reference to a table 
of the normal probability integral. This assumption is unsafe in many cases, even 
though we know that as the size of the sample is increased, the sampling distribu- 
tions of correlation functions approach the normal. The trouble is that for small 
samples, these sampling distributions are often far from normal, and in many cases 
the approach to normality with increase in sample size is very slow. For some of 
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the functions to be considered, a sample of 1000 is still 4 small sample. 

In spite of all the difficulties above noted, it is still true that an ap- 
proximate standard error, based on an unknown sampling distribution, is better than 
none. The following standard error formulas are therefore offered for whatever 
they may be worth. It is to be hoped that eventually their exact sampling distri- 
butions will all be found. 


nr, 


_ 


7 1 + (n - 1)r, . 





Spearman-Brown Formula for the reliability of a test n times as long as the 
half-test. MNote that if n = 2, ry = Rj. 


n(l - r:) . 


(1+ (mn - 1)ri)* | 





This formula was first given by Shen (1924). 
i riln 
sg ~ 2326 


Spearman-Brown Formula solved for n to estimate the length of a test (measured 
in terms of the half-test length as the unit) necessary to achieve a given re- 
liability. 


2 
n(l + r,) 
Ti 
This formula was first published by the writer in 1935. 


~ 
VR 
Correlation corrected for attenuation in one variate. This variate is usually a 
criterion. 


2 « z - . 
alan) «GY - oon) 


This formula was first given by Cureton and Dunlap (1950, B), in a slightly dif- 
ferent form. 


2 
Tealos 2res 2 . (2 - A 
Py = l-frig -?T es 
N Toslos 2 | (222) ( 12 is) Ri 


1-R 
-Ql- ris - Tis - P33) - (2 - Tia - ris) (245) |- 


This formula will be needed in finding the standard error of the difference between 
the correlations of two tests with the same criterion, corrected for attenuation in 
the criterion in each case. 


= 1 - \/ 1 = Tos’ 
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Index of forecasting efficiency for the case of a "true" criterion. This func- 


T 
tion is discussed by Conrad and Martin elsewhere in this issue of the J. Exp. 
Educ., and is designated Eq x by them. 


fy=s%\" /1-28,\ », (oo 
R ™ ] = (1 mo Tia) Ry ¢ 
1 \ / 


Tia 


Correlation corrected for attenuation. This formula is equivalent algebraically 


to the formula, 


4 \ rite 
’ 


whose standard error has been given by Shen in a very lengthy form. 
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a= ats) (AE) - @ - he) (253 
\ j/ 
This standard error function was derived by Cureton and Dunlap (1930, A), 
the formula as published contained an error in one of its terms, and should be 


but 


replaced by the present formula. 
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These last two formulas will be necessary in obtaining the standard error of the 
difference between two correlations corrected for attenuation, as well as in ob- 
taining the standard errors of other functions of correlations corrected for at- 
tenuation. 


Before proceeding further, we define arbitrarily two functions, A and B. 


(1 - ria) (1 “ re) + (risTae - PieTas)’. 

(ris + Tie + Tas + Toa) + 2Pistsa(TisPac + Ticlas) 
- 2rya(Tisles + TiaT2a) - 2Faa(TisTis + Teslaa)- 
Tisl2a — Tiala2sa- 


Tetrad difference. This is the usual form of this function, as employed in the 
study of Spearman's theory of two factors. 


“ “ 2 2 2 2 
=B+t (ria + Tis + Tig + Tas + Tae + Taq - 4)- 
This formula was first given by Kelley (1928, p. 49). 


t 





ee 2 : 

\/(2 - riz)(1 - rae) 
Vector correlation. This function was derived by Hotelling, who suggests that it 
be used instead of the tetrad for the purpose of testing the two-factor theory. 


A” — aB 


(1 - ri2)*(1 ie r34)” 





= (1 - Q*)* - 


This formula was first derived by Madow. Neither of these last two formulas has 
been published to date, so far as the writer is aware. 


t 


vm > TRiRaRaR. 


Tetrad of correlations corrected for attenuation. This has been proposed as a 
substitute for t in testing the theory of two factors. 
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t me = He b= a4 - Fae <~ Fae? | ° (33) 
A formula for the standard error of a tetrad of correlations corrected for attenua- 
tion has been given by Garrett and Anastasi (1952). Their formula is based on the 
wholly inadmissible assumption that the sampling correlations 5 aS and TEwyn 


are equal respectively to r,r., and Trisrse' Their formula should be superseded 


entirely by the one given above. 
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The triad. This function is equal to the proportion of the non-chance variance of 
xX, which is due to a general factor running through all three variates, if the theory 
of two factors holds, so that the three variates may be considered to be composed of 
one general factor and three specific factors. This function has been discussed at 
some length by Cureton and Dunlap (1930, C). 
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This formula was first presented by Cureton and Dunlap (1930, C). Now if 
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a 2 — L£Tie 2 a 
“2 ) (2rieTea - Tielis - Testis) + == ~ £45 ~ fay 
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Tis\ Re 
2 2 gS \] 

- (1 - Tig - Tis - Tas) |- \ 
This formula will be needed in computing the standard error of the difference between two 
triads, in order to estimate the significance of the difference between the general-factor 
saturations of tests x; and x5. Finally, we may note the value of the symmetric determinant, 


1 Piz Tis Tia 

r 1 r r ame 
A = 18 as as = B = B. (37) 

Tis Ta3 l l's4 

Tia Tea Tae 1 


III. The Case of Experimental Independence 


This case has been treated by the writer elsewhere at some length (Cureton, 
1931). No effort will be made here to summarize this work as has been done above 
for the case of comparable measures. 

There is one special case, however, which is of interest in connection with 
the paper by Conrad and Martin in this issue of the J. Exp. Educ. The index of 
forecasting efficiency for the case of a "true" criterion may be estimated by means 
of the second procedure, whether experimental independence is obtained or not. This 
index is a function of one set of test scores and two sets of criterion measures. 
If these measures are experimentally independent, the reliability of the criterion 
will be an "average reliability." If they are not, it will be an "instantaneous 
reliability.” In either case, we compute r,; = ryyz, the reliability coefficient of 
the criterion, and also rig and rtg, the correlations between each set of criterion 
measures and the test scores. Comparability is not demanded, whether experimental 
independence is present or not. The only requirement is that errors of measurement 
in the two criterion measures shall be independent of each other and of the true 
abilities of the subjects. In obtaining this necessary independence, the investi- 
gator is free to divide his criterion into halves in any way he chooses. The two 
half-criterion measures may be wholly unequal in number of items (average of 2 
judges against average of 3, e.g.), units of measurement, standard deviations, and 
correlations with the test. The index may be written in the present notation, 





Tier 
E.= 1 \J _ Fiala | (38) 
ab | 
. TiaTI2 _ = . . 
Let —"" F. Then Ew~=1 - 1 -F. Taking differentials, squaring, summing 
for all samples, and dividing by the number of samples, 
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The value of of is known (Cureton, 1931, p. 55, Formula 17). Substituting in the 
above equation and simplifying, 
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APPENDIX I. Derivations of Certain Preliminary Formulas 
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this value and simplifying, 
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Substituting from (5) and (6) and simplifying, 
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a = a ae 
On iries - 2 Tia(l ri) (1 tia). 


No = No ° 
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NO. res ~ "te de , Piatla ' 2 (rf, . is * 8h "Ts 


5 (Tiry.Ti, * Tily.Trs * TiskIatas ’ T43TI3la3)° 


Substituting from (5) and (6) and simplifying, 
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THE GENERAL NATURE AND APPLICABILITY OF INDEX NUMBERS 
FOR EDUCATION- 
by 
Douglas E. Scates 
Director of School Research 
Cincinnati, Ohio 


Contrary to the belief held by The index number 
many persons, index numbers are not re- fords an important means 
stricted to the realm of general econom- (measuring) complex varia 
ies, but are serviceable in many fields, | especially adapted to measur 
and are equally well adapted to problems in phenomena which cannot be 
which do not directly involve money in identified except in terms of 
any way. Index numbers have received a component or resultant elements.’ 
wider use in the field of education than take a common example, the concept, 
is generally recognized. Perhaps their "cost of living", is an abstraction 
varied use in educational problems is not which does not exist as a concrete 
better known because they have not been tity, and cannot be dealt with directly 
expressly treated in educational litera- as a single object. Its status depends 
ture as_a technique of general applica- upon the many variable elements which 
bility.” The procedure has been borrowed | compose it. The same thing is true with 
from economics and applied to education- regard to many of our concepts of quali 
al problems by individual workers with- ties or properties of logically complex 
out attention having been called explicit-| phenomena--i.e., those wnich comprise a 
ly to the nature of the adaptations that number of different kinds of elements. 
have been made, and to the possibilities If one desires to measure such : 
of further use, both for immediate serv- character as "cost of living", or "size" 
ice and for research. It is the purpose of school systems, or "general goodness" 

f this article to make a beginning in of school systems, one theoretically has 
this direction. It will discuss some of four courses open to him: (1) he can 
the general characteristics of index num-/ rely on a single selected characteristic 
bers,°® and refer to the uses that have which can be directly measured or count- 


been made of them in education. ed, to represent the general character; 


: 


en 








1. This article is based in part upon material in a section of a forthcoming book, The Methodology 
of Educational Research, by Carter V. Good, A. S. Barr, and Douglas E. Scates, to be published 
this spring by the D. Appleton-Century Co., New York. 

Exception should perhaps be made for two articles by Clark; his treatises are however limited to 
price index numbers. Harold F. Clark, Index Numbers in School Administration. Bulletin of the 
School of Education, Indiana University, III (January, 1927), No. 3. Bloomington, Indiana: 
Bureau of Cooperative Research, August, 1928. P. 35. 
See also, by the same author: "Index Numbers in Educational Work", Teachers College 

Record, XXX (February, 1929), 453-60. 
Frisch gives an admirable summary of the characteristics and the theoretical bases of index num- 
bers. The treatment however is restricted to price index numbers. Ragnar Frisch, "Annual Survey 
of General Economic Theory: The Problem of Index Numbers", Econometrica, IV (January, 1936), 
1-38, 

- Measurement procedures appropriate for different kinds of variables are treated by the writer in 
a forthcoming article in Psychometrika entitled, "The Essential Characteristics of Measurement." 
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is striving to index; (3) he can rely 
n personal estimates of the degree to 
wnich the characteristic or quality ex- 
ists in different situations; or (4) he 
n utilize the index number technique to | 
ne tw r more quantifiable charac- | 
ristics which are included in, or defi-| 
nitely correlated with, the general con- | 
-ept which he desires to index.° This 
procedure is usually the most satis—- 
ying, if it can be carried out properly, 
because it reflects variations in the 
factors which (normally) contribute to 
variation in the phenomenon being studied. 
It may be pointed out also that there is 
no mathematical reason why certain resul- 
tant variables could not be included also 
seemed logically desirable to do so, 
r if it had been proved by research to | 
be helpful to do so. 

Index numbers are most commonly 
thougnt of as applying to variation from 
time to time, as from one year to an- 

ther, but there is nothing in their na- 
ture that makes them more applicable to 
time series than to variation expressed 


to any other significant fac- 
In 


in relation 
tor (presumably an independent one). 
of index numbers referred to 
time, the particular times for which the 
numbers are calculated serve merely as 
designated points, expressed in terms of 
the variable "time", for the observation 
values of the sundry components. 
case of index numbers which re- 
variation from place to place, the 


tne case 


of the 
In the 


flect 


| 
| 
| 
| 
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ize some effect spiieianal locations serve as observation points, 


and relate the recorded data to the sig- 
nificant variable, "space", or "place." 
Any other general variable that is ap- 
propriate to the particular problem 
might be used just as well, in lieu of 
time or space. In fact, the place-to- 
place index numbers are really related 
more closely to "situation" than to 
"space" as a variable, for observations 
are not taken at regular intervals 
(along a geographical line), as in the 
case of time, but are rather taken at 
certain locations (usually cities) 
identifiable by common knowledge as rel- 
atively unique conglomerates. The index 
number technique is applicable wherever 
a general variable to be indexed can be 
regarded as representable by the summa- 
tion of (weighted) percentage variations 
in a number of elements, the observa- 
tions of these elements being taken at 
identifiable points. 

An index number may be looked 
upon either as a weighted average of ra- 
tios or as a ratio of weighted summa- 
tions (aggregates). Fisher® lays stress 
on the fact that it is an average of ra- 
tios, while King? is equally emphatic in 
stating that it is a ratio of aggregates, 
and Young® recognizes three different 
types. Such distinctions are scarcely 
of mathematical significance, since a 
weighted average is practically a ratio 
of aggregates; these emphases may, how- 
ever, be helpful at different times in 
seeing the nature of index numbers from 
different points of view under varying 





5. Note however that there are certain limitations or restrictions in the interpretation of index 
numbers, as pointed out by Leontief, and others. 
and the Problem of Index Numbers", Econometrica, IV (January, 1936), 39-59. 
yn the preceding page) also deals with this problem. 


"Composite Commodities 
Frisch (footnote 3, 
It is outlined later in the present paper. 


See Wassily Leontief: 


6. For definitions of index numbers, see Irving Fisher, The Making of Index Numbers, Chapter I, 


"Introduction", pp. 1-10. 
Boston: 
7. Willford I. King, 
rially pp. 46-49. 





New York: 


H. L. Rietz. Boston: 





Publications of the Pollack Foundation for Economic Research, No. l. 
Houghton Mifflin Co., 3d ed., rev., 1927, (2d ptg., December 1931), p. 538. 

Index Numbers Elucidated, Chapter III, "The Nature of Index Numbers", espe- 
Longmans, Green and Co., 1950, p. 226. 

See the comments of Allyn A. Young, p. 181, in Handbook of Mathematical Statistics, ed. by 





Houghton Mifflin Co., 1924, p. 22l. 
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circumstances. While index numbers are 
sommonly conceived of as consisting of 
sets of factors, one a set of items 
elements (p) of the general concept, 
and the other a set of weights (q) ap- 
plied to these elements, it is readily 
possible for index numbers to embody 

three or more sets of factors. This may 
be accomplished by combining two or more 
variables into some function of them, and 
using the resulting values as a single 
set of factors in the index number, or it 
may be done through adapting the formula 
accommodate additional factors.9 

One of the features of an index 

number which distinguishes it from just 
any weighted composite is that variations 
in the value of the items are conven- 
tionally expressed as per cents. These 
per cents or ratios represent the chang- 
ing values of the items referred to their 
value at some selected point (time, or 
place, or situation), designated as the 
base, for which the index number will be 
100. This use of ratios affords 4 unit 
of measure which will be comparable from 
item to item, at least within a certain 
sense. The selection of a base point (a 
given year, locality, or situation), with 
its attendant value for each factor, 
while not entirely without effect on the 
resulting comparisons, is not a crucial 
matter, and may be done arbitrarily to 
suit convenience. Various forms of base 
are sometimes used, such as an average of 
the values at several points, or a moving 
base, resulting directly in link rela- 


two 
w 
4 


to 


tives, which may subsequently be referred 


back to a single fixed base. 
Occasionally a special form of 

base is used, and this may not grow di- 

rectly out of actual values. 


Douglas E. Scates 
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Thus Ayres, | 








in his historic index number of s 
school systems .O did not use 
values as a base, but rather a set of 
theoretical values representing stand- 
ards. Again, indexes of business or 
economic activity which reflect varia- 
tions above and below normal, as repre- 
sented by Babson's charts, or "American 
Business Activity Since 1790". do not 
express the values as per cents of a 
fixed base, designated at some point of 
time, but rather as per cents of a base 
which is a function of the aggregate 
variations themselves, and which is, 
therefore, in a sense, a dependent vari- 
able. 

It is not, however, appropriate 
to regard every series of ratios as in- 
dex numbers, even though many of them 
are called such. The concept of an in- 
dex number posits a variety of elements 
which will be combined into a summation 
to represent as closely as possible a 
complex variable, and a single element 
in itself does not satisfy this condi- 
tion. Thus, the use of holding power of 
the schools,l2 or the average number of 
days of attendance for each child of 
school age 15 as an index of the effi- 
ciency of school systems, can scarcely 
be regarded as an illustration of index 
numbers. On the side of logic, it may 
be said that these elements are too sim- 
ple to constitute adequate representa- 
tions of school efficiency in any gener 
al sense, however satisfactory they may 
be for certain special purposes. An in- 
dex number presumes to be generally rep- 
resentative unless explicitly limited. 
On the technical side, it may be said 
that such series are simply values 
turned into per cents. Young aptly 





9. See for example: J. K. Wisniewski, "Extension of Fisher's Formula Number 
Journal of the American Statistical Association, XXVI (March, 1931), 62-65. 
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New York: Russell Sage Founda- 





Variables." 
Series No. 173. 
10. Leonard P. Ayres, An Index Number for State School Systems. 
tion, 1920. P. 70. 
ll. Published by the Cleveland Trust Company; several editions at various dates, February 1932 and 
later. 
12. Ernest C. Witham, "Index of Holding Power", American Educational Digest, 46 (August, 1927), 


548-51. 





Ernest C. Witham, "Public School Progress of the States", American School Board Journal, 


75 (October, 1927), 37-39. 


15. A. B. Sias, The Financing of a State School System, Doctor's thesis, Stanford University, Cali- 








fornia, 1926. 
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terms them "relatives", or "relative nus- One occasionally finds the term 
bers." They constitute single elements "index number" used where practically 
wiich, along with a number of other dif- none of the characteristics of a conven- 
erent elements, might be combined to tional index number are present. For 
form an index number. Here the concept | example, an index number of teacher 
f an index number as an average of sev- training} and an index number of first 
eral such series of ratios is serviceable.| year university work!’ have been report- 
Again, the use of natural aggre- | ed, neither of which was expressed in 


gates, even when expressed as percentage per cents, nor prepared by combining da 
variation from time to time or from place | ta for a number of separate elements. 
to place, lacks certain of the character-| Such use of the term for special indexes 
isties of an index number. Thus, one may | developed for particular purposes and 


salculate trends showing the change dur- representing almost any kind of function 
ing the past ten or twenty yearsin school | must be regarded as colloquial, and un- 
enrollment, in school building construc- desirable; it cannot help being mislead- 


ton,4 or in school expenditures,5 using | ing, both as to the nature of what is 
totals for states or for the nation. It | presented, and as to the general nature 


Gua 


be argued that as totals these repre-| of index numbers. Burns, in calculating 








ma 
sent summations of many component vari- his index of transportation need 28 prop- 
ables--more, in fact, than could be se- erly refrains from calling it an index 
cured for constructing an index number, | number. 
and they are weighted exactly right. Such | A third characteristic of the 
contentions are probably fair, but the | typical index number is that it is a sam- 
index number technique is a procedure for | ple. Probably this characteristic would 
weighting and combining various selected | not be set up as a requirement, but it 
elements, and the employing of natural | is at least typical. An index number is 
totals precludes the application of these| expected to represent the fluctuations 
processes. The matter at issue is, of of a general category, or of a large 
course, one of definition, and not of | class of elements, through being calcu- 
value. The availability of complete to- lated from a selected group of elements 
tals which meet the conditions of the which are a representative sample of the 
general concept to be indexed is extreme-| entire class. In other words, an index 
ly fortunate, and such figures are su- number is expected to be the basis of a 
perior for most purposes to any that an generalization about a larger group of 
index number could yield. When expressed| variables than those which are actually 
as per cents, they should, however, be included in the calculation.19 

lled "ratios", or "relatives." It is interesting to take notice 











14. See for illustration, "The Nation's School Building Needs", Research Bulletin of the National Ed- 
acation Association, XIII (January, 1935), No. 1. Figure II on p. 7, and Figure VII on p. 27. 
15. John K. Norton, The Ability of the States to Support Education. Washington, D. C.: The Nation- 
al Education Association, 1926. P. 88. 
16. W. R. Burgess, "The Education of Teachers in Fourteen States", Journal of Educational Research 
1921), 161-72. 























(March, 
W. R. Burgess, "The Rate of Progress in Teacher Preparation", Journal of Educational 
Research, IV (October, 1921), 180-86. 
17. Douglas E, Scates, "A Study of High-School and First-Year University Grades." School Review, 
(March, 1924), 182-192. 
18. Robert Leo Burns, Measurement of the Need for Transporting Pupils. Contributions to Education, 
No. £89, N. Y.: Teachers College, Columbia University, 1927. P. 61. 


19. The restrictions earlier called attention to (see footnote 5, page 266) concern primarily the 
interpretation of index numbers, and should be borne in mind whenever they are used. 
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* the several different ways in which 
this sampling may occur. In the first 
place, the general class may be sampled, 
just referred to, by selecting cer- 


Thus, not alt | 


tain constituent elements. 
kinds of articles sold at wholesale, not 
all elements of size or merit of school 
systems, would or could be included in 

an index number of wholesale prices or of 
excellence of school systems. Certain 
ynes would be chosen to represent the en- 
tire group. On the other hand, an index 
number of school bonds issued for build- 
ing construction deals with a concept 
wnich is relatively homogeneous, and, 
after an appropriate definition for the 
class has been worked out, there would 
not appear to be any problems of sampling 
the constituents of the class, since 
(presumably) whatever sub-classes there 
were would behave approximately alike. In 
similar manner, one could construct an 
index number for other general variables 


which were either simple (with respect to 


sub-classes of significant elements) or 


Douglas E. Scates 


| ity). 





which were reasonably complex but com- 
pletely represented, without encounter- 
ing important problems of sampling the 
constituents. As an example of complete- 
ly representing a complex variable, a 
manufacturing company might make an 
analysis at the beginning of each year,or 
each quarter, of the orders on hand and 
the detailed operations required on each 
of its machines, to prepare an index num- 
ber of its production load for the next 
period ahead. 

A second way in which sampling 
may occur is with reference to the field. 
Index numbers of commodity prices common- 
ly sample both the class and the field; 
that is, they include only selected items, 
and they price these at selected locali- 
ties throughout the country. An index 
number of a simple case (as school bonds), 
would normally sample only the field; for 
example, prices would be secured from 
various localities, each price probably 
being weighted by the number of bonds sold 





| in terms of a4 
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(either at that price, or in that local- 
Field samples may, of course, be 
taken with respect to any significant 
variable in the field--time, space, 
specified condition. 

Data which are complete with 


reference to geographic distribution, 


or 


| such for example as index numbers for 


all of the forty-eight states (varying 
from place to place), may be regarded as 
field samples with respect to some other 
variable, as special condition, or pos- 
sibly time. Perhaps it is worth call- 
ing attention to the fact that the sam- 
pling will not be done on the independ- 
ent or reference variable which is con- 
sidered of major significance; that is, 
sampling prices of commodities by taking 
selected localities throughout the na- 
tion is done to secure a representation 
of the prices generally prevailing at 
that time for an index number whose chief 
independent variable is time. (While 
index numbers are reported for prices at 
different localities these are not place- 
to-place index numbers, since their per- 
cent variation is expressed on a time 
and not a location base.) Similarly, 
sampling with respect to time is appro- 
priate when the chief independent vari- 
able is something else. For example, 
time sampling of children's behavior in 
observational studies is appropriate 
when variation is not to be related to 
time but to "different pupils" or "dif- 
ferent situations" as the independent 
reference variable. 

Attention was earlier called to 
the value and appropriateness of the in- 
dex number technique generally for rep- 
resenting complex variation. Since mul- 
tiple regression equations also provide 
a means of indexing complex variation, 
summation of weighted com- 
ponents, it may be desirable to make cer- 
tain comparisons of the two techniques. 
Some of the distinctions exist with re- 
gard to form; perhaps the most important 
ones exist with regard to use. For 
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example, the multiple regression equa- 
tion requires a suitable quantitative 
criterion, whereas this may be the very 
thing that one is seeking.*9 It is in 
fact quite possible that an index number 
representing some complex phenomenon 
might be constructed to serve as the de- 
pendent variable for the purpose of solv- 
ing a multiple regression equation. In 
addition to the requirement of a criteri- 
on, the multiple correlation technique, 
as commonly used, requires linear rela- 
tionships; it is not well adapted to a 
large number of variables; and the system 
of weighting is relatively simple. In the 
case of index mambers, on the other hand, 
there is no requirement of a mathematical 
criterion, there is no assumption as to 
the mathematical form of relationships 
between the values of the elements, the 
technique may be readily extended to in- 
clude hundreds of component variables, 
and the weighting may be built upin prac- 
tically any form desired, even varying 
the form with the particular element 
this is appropriate. 

Interesting perhaps more from the 
theoretical standpoint than the practi- 
cal, is the fact that variation from ob- 
servation to observation is not essen- 
tial to an index number, while it is a 
requirement for correlation. That is, 
index numbers might conceivably continue 
to be 100 for several successive sets of 
observations; if correlation were at- 
tempted for these values, the coefficient 
would be either zero or indeterminate 
(0/0), for the entire scattergram would 
be concentrated along one axis or at a 
Single point. That is, a series would 
consist of constants. Of somewhat more 
practical significance is the fact that 
an index number requires only two sets of 
servations (that is, two observations 
for each variable or element), so that an 
index number becomes possible as soon as 
a second observation point has been 
reached. Simple correlation can be cal- 
culated between two observations for each 


if 





variable, but it is either 1.00, -1.00, 
zero, or indeterminate, and the partial 
correlations are chaotic and meaningless 
in most cases. In both techniques dif- 
ferences in means from one variable to 
another are immaterial; differences be- 
tween the dispersion of one variable and 
that of another have the effect of weight- 
ing in index numbers, while in correla- 
tion they are of no effect unless the 
variability of a certain element is ab- 
normally restricted, as by selection, in 
which case the correlation is lowered 
and thus ultimately the weight of that 
element is reduced, as in the index nun- 
ber. In both techniques the separate 
(weighted) variables are summed, though 
in certain of the more elaborate (and as 
yet little used) forms of the regression 
equation, product terms appear. 

As index numbers are commonly 
used, they can be more safely interpret- 
ed in terms of cause and effect than can 
the majority of correlation coefficients. 
This fact does not grow directly out of 
the nature of the mathematical relations 
so much as it does out of the selection 
of variables which is usually made. In 
most applications of index numbers, ele- 
ments are included which have a known 
and demonstrable relation to the general 
category indexed. That is, for example, 
an index number of building costs will 
consist of such factors as labor and 
building materials--variables which ob- 
viously contribute rather directly to 
fluctuations in building costs. Corre- 
lation coefficients, on the other hand, 
are commonly calculated to see whether 
any relationship between one variable 
and another exists, and if a mathemati- 
cal relationship is found, the structur- 
al form of this relationship still has 
to be ascertained before anything can be 
said with regard to cause and effect. In 
this respect, index number and correla- 
tion techniques frequently proceed in 
opposite directions. 

In thus differentiating between 





20. While Hotelling has developed an ingenious extension of the usual conception of criteria, or de- 
pendent variables, his method still does not make them universally available. 


See Harold Hotel- 


ling, "The Most Predictable Criterion", Journal of Educational Psychology, XXVI (February, 1935) 
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the requirements of the two techniques, 
there is no intention of implying that an 
index number is better than a multiple 
regression equation in any particular 
case. It is simply applicable to certain 
kinds of problems under a wider range of 
conditions. If an independent criterion 
of the complex (dependent) variable were 
available, one could determine by corre- 
lation whether an index number or 4 re- 
gression equation provided the best meth- 
od of approximating it (assuming the nec- 
essary variation). It will also be rec- 
ognized that correlation techniques have 
their own unique sphere of service, into 
which index numbers do not intrude, such 
for example as determining relative con- 
tributions. 

At this point we may turn atten- 
tion to certain considerations with re- 
gard to the actual preparation and useof 
index numbers, It would seem that there 
are six phases of the work which should 
be given attention. The first of these, 
that of selecting elements or factors to 
be included, has already been referred 
to. Assuming that the general variable 
to be indexed is complex, one's first 
thought will be to use elements which are 
definitely representative of this gener- 
al category or class. This is partly an 
abstract matter, partly an empirical one. 
Questions of logic, of interpretation, 
and of definition will be involved; also, 
one's acquaintance with the factors which 
are available for measurement will in- 
fluence his decision. It is somewhat 
common for one's judgment to be too heav- 
ily influenced by this second group of 
considerations, especially in attempts to 
measure status with respect to abstract 
concepts, such as "merit." It should be 
borne in mind that while multiple corre- 
lation has its criterion variable, (fre- 
quently resting very heavily upon judg- 
ment), the index number depends for its 
validity directly upon the component 
variables which are included in it, and 








21. A question may arise as to whether to weight by 
Certain formlas use a combination of the two. 


importance, 


these are usually selected on the basis 
of individual or group judgment. 

There are situations in which 
the selection of items to be included in 
the index number is more a matter of 
sampling of the category than it is a 
matter of judgment as to what factors 
are properly embraced by the general 
concept. For example, in the field of 
wholesale prices, anything sold at whole- 
Sale might be included, and selectionis 
largely a matter of sampling of consti- 
tuents, as previously discussed. Ifa 
question concerning the importance of a 
particular item were raised, such a ques- 
tion would be answered by pointing out 
that the commodity in question was 
weighted in accordance with its volume 
of turnover. Where no serious question 
enters as to what properly constitutes 
an element in the general concept repre- 
sented, the matter of judgment is not 
so prominent. It is of course recog- 
nized that in any field, problems of 
definition will occur. 

Proper weighting is a second 
matter to be considered in constructing 
index numbers. In the field of simple 
price indexes, one may weight by the 
quantity sold, and there can be little 
argument .*1 In indexes for more varie- 
gated or more subjective characteris- 
tics, such as general price level, gen- 
eral business activity, or the efficien- 
cy of school systems, the problem of 
weighting becomes more uncertain, and 
frequently it must rest largely on judg- 
ment after careful analysis. In any 
class (general concept) having widely 
differing kinds of elements, which are 
variously received or used (purchased) 
by different persons, weighting can rep- 
resent only a sort of average of field 
conditions, and will not usually be ac- 
curate for any particular case (person, 
or group of persons). For example, with 
reference to "cost of living", families 
on different economic levels, or on the 





the quantities in the base or in the given year. 
Exact weighting may not be of large practical 


See Fisher, op. cit. (footnote 6, page 266), pp. 528, 346-48, 452, 447-49. 
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same economic level but with different 
tastes, spend very different portions of 
their budget for elements which enter 
into the cost of living index, so that 
any weighting used will probably not fit | 
any large number of families.*© The same | 
complication may arise in other applica- 
tions of index numbers. Further, when 
reciprocals of index numbers are used, as | 
for the purpose of indicating the gener- | 
al purchasing power of money, the rela- 
tive weighting is changed, after the man- 
ner that the weighting of items in the 
harmonic mean differs from the weighting 
of the same items in an ordinary arith- 
metic mean, The matter of weighting, 
therefore, presents complications which 
are not readily solved. 

Reference may be made in passing 
to a prevalent misconception that when 
one uses data as collected, the data are 
not weighted. Such data have, however, 

a@ natural weighting which is just as real 
(and may be just as wrong for a certain 
purpose) as any artificial weighting that 
may be assigned. In the case of index 
numbers, the natural weighting will arise 
from the differences in the variability 
of elements. On the other hand, some 
workers, when combining series to form a 
composite, mechanically proceed to re- 
move this natural weighting and reduce 
all of the variables in their studies to 
equal weighting, presumably with the 





| makes his work "objective." 


thought that they have thereby relieved 
themselves of responsibility for judg- 
ment, and have made their work perfect- 
ly objective. While this is not likely 
to be dome in the case of the elements 
of index numbers, where a formula is 
usually followed, one may, in the same 
frame of mind, omit values for the nomi- 
nal or assigned weights of the elements, 
believing that a uniform weight of unity 
It should 
be made explicit that equal weighting is 
likely to be much less justified than 
approximate, arbitrary weighting. 
psychological fields the matter of 
weighting is primarily one of judgment, 
barring the extensive research which 
might remove judgment another step, and 
one does not improve his work by failing 
to exercise judgment where it is called 
for. 


In 


Definition of detailed concepts 
constitutes a third important aspect of 
gathering data for index numbers. Ele- 
mental classes, and measures of then, 
must be uniformly defined. While obvi- 
ous, this matter is frequently not given 
appropriate attention. For example, 
such an apparently simple thing as "one 
day of attendance" varies a great deal 
in its concept from one school system to 
another. Phillips gives an illuminating 
discussion of this difficulty25 As in 
any area of critical work, the perception 





22. For an index number to be generally interpretable as having a precise significance for each of 
the various situations (persons or groups on different economic levels, or situations varying in 
any other factor that is related to weights) calls for the assumption that all of the weights 
will vary in the same proportion from one situation to another (e.g., from one income group to 


another). 


This obviously is not likely to occur. The condition can however be satisfied if, for 


each magnitude (class interval) of price ratios, the average of weights is constant from situa- 
tion to situation; or if, for each magnitude (class interval) of weights, the average of price 


ratios is constant from situation to situation. 


That is, the correlation ratio between weights 


and situations with price ratio constant, must equal zero; or the correlation ratio between 


price ratios and situations, with weights constant, must equal zero. 


Other conditions which are 


theoretically satisfactory are that all of the weights for the various commodities should be 
equal, that all of the price ratios for the various commodities should be equal, or that the 


weights for the various commodities should not vary from situation to situation. 


Perhaps other 


25. 


conditions which will be satisfactory can be found. These statements refer of course to the 
true weights in the field; there are as many weights for each commodity as there are situations. 
The statements are made in terms of economic concepts because these are more readily followed. 
Application to other fields may be readily made. 

Frank M. Phillips, "Educational Rank of the States, 1950", Section II, "Uniform Definitions, Rec- 
ords, and Reports", American School Board Journal, 84 (March and April, 1932). Also in a pamph- 





let of same title, published by the author, Washington, D. C., 1932, pp. 25-40. 
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of difficulties, (when not exaggerated), 
is in itself an indication of some skill 
and maturity, and common units and 
traits (variables) are frequently not 
well defined principally because workers 
have not developed a sufficient familiar- 
ity with the field in which they are con- 
ducting research. 

A fourth matter for attention is 
the sampling of the source-field. Per- 
haps this has been discussed sufficient- 
ly in connection with the general nature 
of index numbers. It is similar to the 
problem of sampling in any research work. 
It may be that in some instances complete 
field data can be secured. In any large 
study, however, this is impossible. In 
the case of prices, one cannot gather da- 
ta on all of the prices of any single 
commodity in every city and village in 
the United States, so one resorts to 
sampling of the field, and gathers data 
from a certain number of cities which he 
believes will also represent other cit- 
ies, 24 King™ contends that the problem of 
sampling (referring probably to sampling 
both of the constituents of the general 
variable and of the source-field) is the 
only real problem of index numbers. While 
other writers recognize the importance of 
sampling they do not concur in such an 
extreme emphasis.26 

Fifth, we have the form of index 
number to be used. This form will in 





part control, or be controlled by, the 
weighting to be given the various ele- 
ments. It will also determine cer- 
tain other characteristics of the re- 
sulting values. Fisher®’gives the most 
extended treatment of formulas, though 
he does not exhaust the possibilities. 
He discusses six types of averages and 
six types of weighting (p. 351), and ana- 
lyzes the resulting index numbers on the 
basis of several criteria. He concludes 
that his formula No. 358 is the "ideal" 
one (see his pp. 360 and 493), but that 
formula No. 2153 is more easily calcu- 
lated, and is practically as good 
(pp. 361 and 494). His formula No. 53 
(estimated to be correct within one per 
cent, pp. 362 and 494) is both rapid and 
Simple to explain to non-technical work- 
ers. It is the form used by the U. S&S. 
Bureau of Labor Statistics in calculat- 
ing the index numbers of wholesale prices, 
retail prices, and cost of living. It 
will be found satisfactory for most pur- 
poses. His formula No. 1, which is a 
Simple average, is the one that has been 
generally used in educational studies; 
Fisher says of it that "It should not be 
used under any circumstances, being al- 
ways biased and usually freakish as 
well,*28 

Other convenient sources of index 
number formulas are Kelley £9 Young © and 
most books on statistical methods in 





24. A description of the methods by which the United States Bureau of Labor Statistics gathers data 


from the field for its index numbers is given in the following bulletin: 
and Computing Statistical Information of the Bureau of Labor Statistics. 


Methods of Procuring 
Bulletin of the JU. S. 








Bureau of Labor Statistics, No. 526. 


These methods have recently been modified, as described in: 


Washington, D. C., March, 1923. 


P. 54. 
Faith M, Williams, Margaret H. Hogg, 


and Evan Clague, "Revision of Index of Cost of Goods Purchased by Wage Earners and Lower-Salaried 
Workers", Monthly Labor Review, XLI (September, 1955), 819-57. 





25. 


King, op. cit. (footnote 35, page 266), p. 49. His position is further developed in Chapter IV, 


"Sampling as Related to Index Numbers", pp. 59-77, and Chapter VII, "Percentages of Error Found 


in Certain Price Indices", pp. 143-88. 
26. 
27. 
28. 
29. 


Fisher, op. cit. (footnote 6, page 266). 
Ibid., pp. 361, 466; see also pp. 64-6. 





Macmillan Co., 1923. pp. 590. 
ly Fisher's formulas Nos. 553, 2153, and 55. 


H. L. Rietz, editor. Boston: 


Truman L. Kelley, Statistical Method, Chapter XIII, "Index Numbers", pp. 551-47. 
On pp. 344-45, Kelley's formulas Nos. 15, 10, 12 are respective- 


See, for example, Fisher, op. cit., (footnote 6, page 266), pp. 556-40, and 524-25. 


New York: The 


. Allyn A. Young, "Index Numbers", Chapter XII, pp. 181-194, in Handbook of Mathematical Statistics, 
Houghton Mifflin Co., 1924. 





P. 221. Young's formulas Hos. 1l, 


15, and 10 are respectively Fisher's formulas Nos. 555, 2155, and 55. 
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274 eras 
economics. Frisch's reviewlincludes able is the individual buying habits of 
many of them. A very practical and com- persons, families, or groups, which ex- 


prehensive treatment is given by Croxton 
and Cowden.°© One should also consult is- 
sues of the Journal of the American Sta- 
tistical Association for a number of 
years back, for many practical and theo- 
retical articles. Most of the useful 
formulas do not present mathematical dif- 
ficulties, though they may appear to do 
so to the tyro. They are generally stat- 
ed in terms which grow out of the econom 
ic field; e.g., p stands for the price of 
a commodity, and q stands for the quan- 
tity of this item that was sold. To 
translate these symbols into non-finan- 
cial terms, p would stand for the value 
observed for any particular elemental 
variable at any particular time, place, 
or circumstance, and q would be the 
weight assigned to this element. 

Sixth, and finally, we should 

the matter of interpretation. 
always a critical step in re- 
and particularly so when 4 mathe- 
matical formula of some complexity has 
been employed. The more refined the 
mathematical reasoning by which the form- 
ula has been derived, the more careful 
one must be in applying and interpreting 
it, for many assumptions are likely to 
have been made, either expressly or in- 
pliedly--mostly the latter. Upon careful 
examination, the interpretation of index 
numbers presents more difficulties than 
are at first apparent. The difficulties 
lie in part in the weighting, andin part 
in the philosophy of value. 

The chief source of the difficul- 
ty, so far as weighting is concerned, 
arises from the fact that there is actual- 
ly a fourth conditioning variable operat- 
ing which ts generally omitted from the 
calculation, but which cannot logically 
be overlooked in the interpretation. Thus, 
to illustrate in terms of an economic in- 
dex number, we commonly recognize the 
three variables time (or place), price, 
and quantity (weight). The fourth vari- 


mention 
This is 
search, 








hibit varying patterns of weights for 
the different elements or commodities, 
and thus make a single or constant pat- 
tern of weights unrepresentative of 
their particular situation. Accordingly, 
one could not select a city in which to 
live by using a cost of living index, 
unless his spending habits conformed 
closely to the weights used in the index 
number. One stands the chance of his 
weighted index number representing no 
actual group or situation at all--which, 
of course, is true of nearly all aver- 
ages, but particularly of means. 

To illustrate the weighting dif- 
ficulty in terms of appraisal, we may 
assume a hypothetical case of a prospec- 
tive student using index number ratings 
on various colleges as a basis for his 
selection. He is interested, let us as- 
sume, in a school in which a broad pro- 
gram of extra-curricular activities is 
emphasized. Index numbers which use an 
average (constant) set of weights for 
all colleges in the country would scarce- 
ly afford a satisfactory comparison for 
his purpose. They would reflect no 
higher rating for an institution giving 
a great deal of attention in its educa- 
tional program to extra-curricular ac- 
tivities than for another institution 
which gave the same quality of work but 
allowed only for a very small amount of 
the student's time in this area, because 
the quality in both cases (which we as- 
sumed to be constant) would be weighted 
by the same weight. 

The second difficulty, as stated, 
lies in the philosophy of value. Per- 
haps "value received" represents a fifth 
variable--and one which also is omitted 
from the calculation. Thus, in the 
field of economics, as prices vary in 
different ways for different commodities, 
the quantities purchased change. A per- 
son may purchase X units of commodity A, 
and Y units of commodity B when these 


























Sl. Frisch, op. cit. (footnote 3, page 265). 


52. Frederick, E. Croxton and Dudley J. Cowden, Practical Business Statistics, Chapter XVII, "The 





Construction of Index Numbers", pp. 362-77, and Chapter XVIII, "Some Current Indexes", pp. 378- 
Prentice-Hall, Inc., 1934. 
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are at a certain price; but if the price 
of commodity A increases more rapidly 
than that of B, one may purchase fewer 
units of A, and more units of B than be- 
fore. Assuming that his total expendi- 
tures have not been increased, his total 
satisfaction may be greater than it 
would be if he continued to purchase the 
original quantities of commodities A and 
B. In other words, as prices change, 
quantities of different items purchased 
also shift, in order that one may get the 
maximum amount of satisfaction from his 
spendable income. It is thus possible, 
under changing conditions, for one to re- 
ceive as great satisfaction as formerly 
with an expenditure that has not changed 
as much (or as little) as a mechanical 
average of price ratios, in their origi- 
nal quantities, would call for. Much de- 
pends upon the flexibility of the indi- 
vidual's scheme of values, and the com- 
pensations that may be made. The analo- 
gous application to values in appraisal 
should be clear; a deficiency in one as- 
pect may be more than compensated for by 
some special perfection in another as- 
pect, giving a joint result, or pattern, 
which is superior to the sum of the in- 
dividual ratings of the two aspects, 
weighted with established weights. Also, 
as in the case of economics, the flexi- 
bility or possibility of such compensa- 
tions will depend somewhat upon the in- 
dividual who is concerned (the user). A 
fixed scheme of arriving at an index nun- 
ber value may prevent it from reflecting 
such compensations, and hence from repre- 
senting accurately the variations in 








true value which exist. 

Such problems of interpretation 
are not important where the general con- 
cept or class being indexed is simple, 
or relatively homogeneous. Here the in- 
dex number may be interpreted as repre- 
senting percentage variation, and as 4ap- 
plying generally. Even when the con- 
cept or class is heterogeneous, one can 
make the same interpretation if he is 
willing to do so "on the average"; with- 
out interpreting his result as being ap- 
plicable to any particular or single 
case. To withhold inferences from in- 
dividual cases is difficult, unless one 
thinks strictly in terms of totals-- 
such, for example, as the general (aver- 
age) level of education in different 
states, as wholes, and without any sig- 
nificance for individual cities in the 
states. Even here the problem of sub- 
limation of value through substitutions 
is present. Such considerations lead 
Leontief™ to refer to index numbers in 
general as "statistical approximations 
to a theoretically indeterminate con- 
cept." 

In order to illustrate more con- 
cretely the applicability of index nun- 
bers to educational problems, examples 
of their use will be mentioned. We may 
begin by referring to those which deal 
with economic aspects of education. In 
this field, H. F. Clark, and certain as- 
sociates, prepared three series of price 
indexes, covering respectively the cost 
of school supplies 4 the price (interest 
rate) for school bonds$* and the cost of 
school buildings.5® The latter field has 





33. Leontief, op. cit. (footnote 5, page 266), p. 45. 
54. Harold F. Clark and John Guy Fowlkes, "Index Numbers for School Supply Prices." 


Appeared month- 


ly in the Nation's Schools from September, 1928 (Vol. II) to December, 1929 (Vol. IV) and was 
then combined with the index of school building costs, being discontinued with the March, 1930 





issue (Vol. V). 


35. Harold F. Clark, "Index of School-Bond Prices." 


Appeared monthly in the American School Board 





Journal from January, 1928 (Vol. 76) to November, 1931 (Vol. 83). 
56. Harold F. Clark, This series began as "School Building Cost Index" in American Education Digest, 


48 (December, 1928), 28. 





Continued as "School Building Index" in School Executives Magazine 





from January to August, 1929 (Vol. 48); continued with Oscar K. Buros, joint author, as "Index 
of School Building Prices", in the same magazine, September to December, 1929 (Vol. 49); then 
combined with the index of school supply prices in the Nation's Schools from January to March, 


1930 (Vol. v). 
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also been worked on by others ,°7 the most 
yutstanding study being that by Burgess£8 
An index number for the cost of equipping 
new buildings was prepared by Loomis.59 
Davis employed a sort of index number to 
study the increasing cost of operating 
school buildings. 40 

bers have been constructed especially for 
teachers,*! to reflect variationsin costs 
from year to year or from place to place. 





Several cost-of-living index nun- | 


in the distribution of state school 
money .*4 

In addition to such special in- 
dex numbers, use has been made of the 
general economic index numbers. For 
example, index numbers of the cost of 
living have been used to show variations 
in the purchasing power of teachers' 
salaries.46 They have also been used 
rather widely to account in part for the 
increasing costs of education since 1900; 





The report of the Committee on the Eco- 

nomic Status of the Teacher*°reviews four 
earlier index numbers 
Warne, Boothe, Eells, 
then constructs a new one, 
1928-34. 
of living of teachers 
of New York. 
Ohio, and the index number was 
as one of the factors recommended for use 


in fact, almost half of the increase in 
expenditures for education from 1900 to 
1930 has been attributed to the decreased 
purchasing power of the dollar during 
this period.47 Another type of compari- 
son that is possible with economic index 
numbers is that between the increase in 
expenditures for education and the in- 
creases in industrial production and 
business activity, as shown by certain 


(those of McKay and 
and Butsch), and 

for the years 
Harry” made a study of the cost 
in different parts 
This study was repeated in 
included 








57. 


59. 


41. 


45. 


44. 


45. 


46. 


47. 


. Harvey H. 
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T. C. Holy and W. E. Arnold, "School Building Expenditures in Relation to School Building 
Costs", American School Board Journal, 85 (July, 1952), 41-42. A. C. Monahan, "Notes", Ameri- 
can School Board Journal, 69 (October, 1924), 64; 71 (September, 1924), 65; and 83 (August, 
1931), 74. 
Randolph W. Burgess, Trends of School Costs. New York: Russell Sage Foundation, 1920. P. 142. 
Chapter V traces building costs from 1841 to 1920, in terms of index numbers. 

Also given in part in the following: "Eighty-year Fluctuations in the Cost of American 
School Buildings", American School Board Journal, 62 (January, 1921) 57-8; also in Proceedings 
and Addresses of the National Education Association, LVIII (1920), 329-30; also in School Life, 
5 (August 15, 1920), 11-12. 
Arthur K. Loomis, The Technique of Estimating School Equipment Costs. 
tion, No. 208. New York: Teachers College, Columbia University, 1926. P. 112. 


Equipment Costs, Teachers College, Columbia University, 1926. P. 259. 
Davis, "An Index of School Plant Operation Costs", American School Board Journal, 73 


(July, 1926), 53. 
T. C. Holy, "Cost of Living Indexes for Teachers' Salaries", Educational Research Bulletin, XII 


(February 8, 1933), 42-45. 

"The Teacher's Economic Position", Research Bulletin of the National Education Association, XIII 
(September, 1955), Chapter VII, pp. 222-42. See also Circular No. 1, January, 1953, of the Edu- 
cational Research Service of the National Education Association, on "Estimating Changes in 
Teachers' Cost of Living." i 
David P. Harry, Cost of Living of Teachers in the State of New York. Contribution to Education, : 
No. 520. New York: Teachers College, Columbia University, 1928. P. 184. 

Equalizing Educational Opportunity in Ohio: a preliminary report of a survey of state and local 
support of public schools in Ohio, prepared under the direction of Paul R. Mort. The Ohio School 
Survey Commission, Columbus, Ohio, November 1, 1932. See pp. 39-40 and 147-49. 

For a description of various economic index numbers which are available, and their sources, see 
Barr, Good, and Scates, op. cit., (footnote 1, on page 265), pp. 445-48. 

F. K. Shuttleworth, "Dollar and Real Incomes of Public School Teachers and of Wage Workers, 1889- 
1890 to 1954-55", Educational Administration and Supervision, XXI (February, 1955), pp. 81-96. 
"Facts on School Costs", Research Bulletin of the National Education Association, X (November, 
1952), 225-24. A criticism of this conclusion has been given by Nelson B. Henry, "Index Numbers 
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index numbers. 

A second use that has been made 
of index numbers in education has been 
for appraisal. This group of index num- 
bers is largely, though not in all cases 
entirely, divorced from direct connec- 
tion with money. Most ambitious of these 
applications have been the attempts to 
rate the educational activities of the 
various states. Reference has already 
been made to the index number prepared by 
Ayres, which has been widely discussed 
among school men. Phillips* extended 
Ayres' index and modified it. Schrammel 
prepared a third index number for the 
states° In addition to these state in- 
dex numbers, there have been at least 
three others prepared for cities and four 
for counties. This whole field has been 
excellently reviewed and summarized by 
the Research Division of the National Ed- 
ucation Association»! and a bibliography 
of 47 references compiled. The same pub- 
lication contains data for the states on 
five factors selected by the Research Di- 
vision, which, however, refrains from 
combining these into a single index be- 
cause of not desiring to decide on the 
relative weighting of each factor. An in- 
dex number for higher education in the 
various states, based on eight factors, 
has been prepared by Chamberlain and 
Meece.-© Private and public education are 









considered both separately and jointly. 

Other studies may be found by 
consulting the topics, "Index Numbers’, 
and "Cost and Standard of Living", in 
the Education Index. Lundberg” lists 
three index numbers that have been pre- 
pared in the field of Sociology to re- 
flect quantitative changes in condi- 
tions. 





Perhaps it is appropriate in 
closing this paper to refer to another 
use of index numbers which, so far as 
the writer is aware, has not yet been 
made. This use is for the purpose of 
combining detailed judgments, or ratings, 
such as are made when a score card is 
employed. Score cards represent an 
elaborate form of rating scale in which 
(typically) a large number of aspects of 
an object to be rated are listed for 
separate attention, each being allotted 
a&@ maximum number of points which may be 
granted. These points, as awarded in 
rating, are added to arrive at a "score" 
for any object. The score is interpret- 
ed either through comparison with the 
score for other objects rated in the 
Same study, or by comparison with cer- 
tain standard values. It would be readi- 
ly possible to modify this technique 
slightly so that an index number would 
result. This would require expressing 
ratings for each aspect or element as 












48. Ayres, op. cit., (footnotel0, page 267). 


49. Frank M. Phillips, "Educational Rank of the States, 1930", American School Board Journal, Vol. 


84, February through May. 
D. C.) Earlier articles: 





(Also available as a forty-page reprint, from the author, Washington, 
"Educational Rank of the States, 1924", American School Board Journal, 





72 (April, 1926), 47, 141; and "Educational Ranking of the States by Two Methods", American 


School Board Journal, 69 (December, 1924), 47-49. 





This early series is available as a thirty- 


two-page publication for the Bruce Publishing Co., Milwaukee, Wisconsin (1925). 


50. Henry E. Schrammel, The Organization of State Departments of Education. 


"Ranking of States Ac- 





cording to Educational Achievements", Chapter IX, pp. 115-34. Bureau of Educational Research 


Monographs, No. 6, Columbus, Ohio: 


Ohio State University Press, 1926. 


51. "Estimating State School Efficiency", Research Bulletin of the National Education Association, X 





(May, 1932), No. 3, 104-112. 


See also, for some additional references: 
New York: 


107. Contributions to Education, 242. 
P. 142, 





52. 


Frank L. Shaw, State School Reports, pp. 103- 
Teachers College, Columbia University, 1926, 





Leo M. Chamberlain and L. E. Meece, State Performance in Higher Education, Bulletin of the 





Bureau of School Service, V (March, 1953), No. 3. 


Kentucky. 
George A. Lundberg, Social Research. 
Appendix C, p. 362. 


55. 





New York: 


P. 357. Lexington, Ky: The University of 


Longmans, Green and Co., 1929. P. 380. See 
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percentage variations from a standard 
condition or quality, in lieu of grant- 
ing so many points. If thought desira- 
ble, limits could be assigned to the max 
{mum variation to be used for each ele- 
ment, such limits having the effect of 
weights. Again, if desirable, certain 
factor weights could be assigned to the 
series of elements, to be effective in 
combining (averaging) the per cents giv- 
en to these elements when rated. 


The result would be an index num 


ber representing a weighted composite of 
judgments on detailed aspects of the ob- 
ject, and varying from 100% as normal. 


| 


It would have certain advantages over the | 


common practice in preparing and using 
score cards. If the per cents were not 
restricted, the new procedure would be 
much more flexible than tne old, allow- 
ing the rater a wider range of "scores" 
on each element, thus providing more ade- 
quately for extreme variations. These 
could later be "toned down" or amplified, 
as appropriate, by the final weights as- 
signed to each element. This plan would 
permit rating above and below normal, in 
place of always rating down from an ideal 
Standard. The use of per cent permits a 
common unit of expression for all of the 
items, instead of the variable scale pre- 
sented in the typical score card by dif- 
ferent numbers of points allowed as max- 
ima for the various items. The per 
cents, being uniform, might result in 
somewhat greater accuracy in expressing 
the rater's judgment. In both cases, 
printed suggestions as to standard condi- 
tions and as to allowances for warious 
other described conditions, are in order. 
A disadvantage of the suggested technique 


| 
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is an increased amount of calculation, 
requiring multiplying if weights are 
used, and in any case, requiring the ad- 
dition of larger numbers. It would be 
subject to the difficulties of interpre- 
tation previously discussed, in connec- 
tion with value, but in this respect it 
does not suffer more than the ordinary 
score card; the difference is that score 
cards have not been subjected to as crit- 
ical scrutiny. 

In summary, the index number 
technique affords a well developed, care- 
fully examined, and versatile procedure 
for combining weighted elements into a 
composite variable. While it owes its 
origin to the field of Economics, it is 
perfectly general in its application, 
being less restricted in a number of 
ways than is multiple regression. It has 
been used in Education to reflect varia- 
tions in the cost of supplies and build- 
ings, and to indicate different levels 
of merit. Certain points that should re- 
ceive particular attention in the use of 
index numbers are: the choice of ele- 
ments, the weighting of these elements, 
their specific definition and units of 
measurement, the sampling of the source- 
field, the form of the index number used, 
and the interpretation of the results. 
There are other applications of index 
numbers that can be made, and it would 
appear that the technique should receive 
a wider recognition and use than has 
been given it in the past. 
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THE USE OF SOUND RECORDING EQUIPMENT IN THE 
AND IMPROVEMENT OF TEACHING 
by 
A. S. Barr and C. D. Jayne 
University of Wisconsin 


One of the great handicaps to the | whether the facts recorded are those of 
scientific study of teaching is the com- greatest significance. In the third 
plexity of the activity, the numerous | place, the data recorded from observa- 
ditioning factors which need to be tion to observation vary so widely in 
pt in mind, and the extreme rapidity | content and form that it is impossibl« 

th which the action takes place. The to compare one class exercise with an- 
complexity of the activity and the rapid-/| other. Fourth, the records are almost 
ty with which it changes is such thatit | always evaluative, presenting infer- 
is physically impossible to observe ac- | ences rather than facts (the ordinary 
surately the things that happen in teach-/ record is an interpretation of what the 
ing and make a reliable evaluation of observer sees and not an objective rec- 
them. It has been apparent for some time | ord of what happens) and finally, the 
that some means of obtaining more com- records are made in terms of verbal sym- 
plete records of the teaching act was es-/| bols, and verbal descriptions, no mat- 
sential to more accurate studiesof teach- | ter how good, are highly personal and 
ing. The more complete and objective the | subjective. 
record the more significant the analysis | The development of the non- 
that can be made from it. evaluative activity check list, which 

Various data-gathering devices has in a way superceded the running ac- 
have been used at various times in at- count method, made possible a more com- 
tempts to build up adequate records of plete, objective, and comparable record. 
the classroom activities. Among the ear-| Such check lists provide a method of re- 
liest of these was a running account of cording the important happenings of the 
tne class period jotted down in rough ab-| class period without any attempt to 
breviated form as the class work proceed-/| evaluate them. Although the check list 
ed. While this was far superior to mere | doubtless represents progress in the 
memory of what had happened, it had many | difficult task of studying the teacher 
serious shortcomings. In the first place,;| at work, it too offers a very insuffi- 
many important factors can not be record | cient record of the happenings of the 
ed by this method due to the slowness of classroom. While check lists may be 
writing and to the fact that the observer | made more objective and reliable, they 
can concentrate upon only one or two as- may be quite incomplete, many details of 
pects of the lesson at a time. Second, | setting, gestural and even verbal ex- 
since the activities recorded represent pression being lost in such reports. 
but a small fraction of the total class A third method employed in re- 
activities, the record is always incom- cording the happenings of a classroom 
plete and there is a question as to is the stenographic report. This 

| 
l. A. S. Barr, An Introduction to the Scientific Study of Classroom Supervision, (New York: 
D. Appletonand Company, 1931) pp. 190-234. 
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aid 
second the stenographic record 
includes only very fragmentary and i 
complete information concerning the no 
auditory phases of the class work. T! 
movements of teacher and pupils, the uss 
of visual aids, the general class atter 
tion, blackb« & neral phys 
ironment of the classr 
personality of the teacher a athered 
from visual p etc., are pret- 
ty largely missing from a stenographic 
: Clearly if a complete record 
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recitation it would consist of a visual- 

auditory report such as one might secure 

nly from a sound motion picture. 
As one looks to the future one 


sees that the desirable record of the fu | 


ture is to be that of the sound motion 
picture. The rapid technological de- 
velopment in this field indicates that 
even now such records are entirely pos- 
sible for those that have the time, 

money and equipment to make then. Such 
records will place before the student of 
education, in permanent form the impor- 
tant facts of teaching. They can be re- 
produced as often as desired, to be stud- 
ied and analyzed one factor at a time. 
It would be hard to overemphasize the im- 
portance of such records in the scientif- 
ic study of teaching. 

As a first step in the produc- 
tion of such records of teaching, exper- 
imental work has been carried on now for 
some time at the University of Wisconsin 
in the field of sound recording. This 
experimentation appears to have reached 
a stage where satisfactory sound records 
can be made of ordinary class work. The 
equipment used in the making of such 
records and some of the problems involved 
are described in this article. 

In the first place it should prob- 
ably be pointed out that there are numer- 
ous methods of producing sound records. 
Of these many methods it seems that only 
two may be practically used under pres- 
ent conditions for classroom recording. 
These methods are the photographic sound 
on film record, and instantaneous record- 
ing on disc. Each method has its advan- 
tages and disadvantages. One of the very 
best means of recording sound is sound on 
film. It is possible by this means to 
secure recordings of very high quality. 
It is also possible to make long uninter- 
rupted recordings up to over an hour in 
length, which is a decided advantage in 
classroom recording. This system is not 





however without its disadvantages. First, 
the film must be developed before it can 
be played back. Immediate reproduction 
is impossible. Second, recording on 
film is a complicated process and too 


technical for the ordinary lay worker. 

| Third, the film records are expensive, 

| delicate, and difficult to handle. Proc- 
| esses are now being developed however 

| that may overcome certain of these diffi- 
| culties. 


At present, while instantaneous 
sound recording on disc is not without 


| certain limitations, it appears to have 


certain advantages over sound on film 
recording for ordinary classroom use. 
First, there is no intermediate process 
between the recording and the reproduc- 
tion. The record may be played back im- 
mediately. Second, the method is simple 
enough so that good results may be ob- 
tained without the services of a trained 
technician. Third, the equipment is 
less expensive than that required for 
other methods of recording. Its chief 
disadvantages are that there is a cer- 
tain amount of needle scratch which 
sound on film avoids and that the amount 
of recording which can be placed on one 
Side of a record is very limited as com- 
pared to the amount on sound film. One 
Side of a twelve-inch record at the 
standard turntable speed of 78 R.P.M. 
will make a recording of about five min- 
utes. By changing to the 33 1/3 R.P.M. 
speed the amount of recording is approx- 
imately doubled but the quality at the 
reduced speed is not so high. Whether 
true at the present time or not, it was 
felt at the time experimentation was 
started at the University of Wisconsin, 
that the instantaneous recording on disc 
method offered the greatest possibili- 
ties in the field of classroom recording. 
Equipment of this type was therefore 
purchased. The various units of this 
equipment are shown in Figure 2. 





3. Prof. M. L. Hanley, University of Wisconsin in Materials tor Research, to be issued by the Joint 
Committee on Materials for Research of the Social Science Research Council and American Council 
of Learned Societies; also in, Proceedings of the Third International Congress of Phonetic Sci- 








ence, to be published by the Cambridge University Press. 
Karl Windesheim, Practical Apparatus for Sound Recording in the Speech Classroom, Labo- 


ratory, and Clinic, (Ph. D. Thesis, University of Wisconsin, 1934) pp. 100-119. 
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Figure 2. The Units of Equipment Employed in Making Sound Records. 
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Windesheim, Ibid., pp. 119-121. 


*. M. L. Hanley, op. cit. 





mechanically inscribes the sound waves 
into a suitable surface. 

5. A record blank, of material 
suitable for receiving and retaining thé 
undulations cut by the stylus, and alsc 
suitable for actuating a reproducing 

s 


stylus. Aluminum blanks have been found 
: : 5 
juite satisfactory.° 


6. A turntable which will re- 
volve the record blank at a constant 
speed. For continuous recording equip- 
ment with two turntables offers several 
advantages. 

7. A phono pick-up which furnish 
es a means of transforming the sound 
wave forms cut upon the record back again 
to their equivalent electrical energy. 

8. Audio-frequency amplifier, 
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the same as in 


the electrical 
1e extent nece 


speaker, 
sounds. 
necessary wires 
plugs, etc., to properly interconnect 
the above equipment. 

11. A control panel for the prop- 
er manipulation of each unit of the 
equipment which will consist of a main 
volume control, a tone control, a fader 
for each microphone, a switch to connect 
the amplifier to either the pick-up or 
the cutting head, and a visual volume in- 

icator to show the power being deliv- 
ered to the cutting head. 

12. A power pack to provid 
necessary electrical energy 
the entire instrument 

3. Headphones to be used by the 
yperator in monitoring the controls. 

Figure 3 shows the wiring dia- 
gram for the Wisconsin instrument. 

In developing the equipment for 
making sound records of classroom pro- 
cedure two major factors were kept in 
mind. First, the equipment must be of 
such nature as to preserve as nearly as 
possible the naturalness of the class 
ituation when a recording is made. 
nearly the record is a typical 
teacher's work the great 
its value. econd, the equipment 
give a fait thful reproduction of al 
the audible happenings of the class. 

To meet the first requirement, 
that of securing a typical sample of the 
teacher's work, it was felt necessary to 
have portable equipment which could be 
taken into the regular classroom. It 
seemed obvious that to take a class into 
a studio would make a situation so arti- 
ficial that any record made would be far 

from the typical sample desired. The 
equipment was, therefore, so designed as 
> be readily transported as may be seen 
in Figure 1. It consists of two main 
units, the recording unit and the ampli- 


more 


] a . 
pile Ys the 





6. For a summary of advantages and disadvantages 


Hanley, op. cit. 


fier. 
the re 
microphones, headph 
ried in a compartment 
recording unit. These tw 
gether weigh about 100 pounds. 
amplifiers with the ee 
to operate them are built as 
unit easily carried about 

In line with 
ing as nearly as possible 
uation, the equipment is 
the microphones are the only 
equipment actually placed in the class 
room. The rest of the equipment and the 
Operator can be located in any cconven- 
ient room nearby. While children and 
teacher may be self-conscious at firs 

soon wears off, and it is probable 

the situation becomes more normal 
I if the supervisor were present tak- 
ing notes. If a teacher of class seems 
to find it difficult to become adjusted 
to the microphone it is possible to 
leave the microphones (or dummy micro- 
phones) in position for several days and 
the record can be taken at any time 
without either the teacher or the 
being conscious of it. This 
will not be necessary. 

The second prerequisite, a 
faithful record of what actually hap- 
pened, taken under classroom conditions, 
puts heavy demands upon the equipment. 
The microphone must be rugged enough to 
stand up under the knocks which constant 
transportation is bound to give it; it 
must be of a non-directional type so as 
to pick up equally well sounds from any 
part of the room; jt must be sensitive 
enough to pick up ordinary conversation 
in any part of the room. Experience has 
shown that a crystal microphone meets 
these requirements very satisfactorily. 

The microphone should carry a 
cable of 35 or 40 feet so that it may be 
plugged into the preamplifier outside of 
the classroom. There should be a shield- 
ed cable of about 70 to connect the 
preamplifier and the amplifier, enough 


buil 


so made 


part of 


ciass 


generally 
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feet 


of various types of sicrophones 900 Prof. M. L. 
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Wiring Diagram of Sound Recording Instrument. 
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to make certain that it will reach a con- 
venient room where the rest of the equip- 
ment can be set up. 

In recording, the recording unit 
and the amplifier are set up side by side. 
The operator, by means of earphones lis- 
tens to what is happening in the class- 
room. By listening with the earphones 
and by watching the needle of the volume 
indicator the operator can soon learn to 
operate his controls so as to get the 
best recording possible. 

Experiments in actual classroom 
situations shows that the biggest prob- 
lem involved is that of overcoming acous- 
tical distortion. When one contrasts the 
ordinary classroom with its hard floors, 
bare walls and hard finished ceiling with 
the sound recording studio with its vari- 
ous sound insulating devices to give the 


exact acoustical properties needed, itis 


not surprising that this is true. When 
one also considers that in studios, with 
the best acoustical conditions, the 
speakers are generally within a few feet 
of the microphone while in classroom re- 
cording the pupils are seated in all 
parts of the room, it is easy to under- 


stand that there are many difficulties to 


be overcome. 

One source of this acoustical 
distortion is the reverberation of the 
room or of objects in the room. Distor- 
tion is produced because the different 


frequency components in the sound may ex- 


perience unequal absorption by the room 
surfaces. 
or certain of them, may diffract or in- 
terfere in varying amounts with the dif- 
ferent frequencies thus causing distor- 
tion. The acoustic conditions may be 
such that some frequencies will be exces- 
Sively damped while others will be over- 
emphasized; there may be reverberation, 
overlapping, echoes and other extraneous 
noise. 

In addition to this type of dis- 
tortion from reverberation there is what 


is known as the "hangover" effect due to the | 
| 





In addition the room surfaces, | 


persistence of sound after its source has 
been silenced. This phenomenon has a 
tendency to blur the sound, to make the 
beginnings and endings of speech sounds 
different and to cause vowel sounds to 
mask succeeding consonants.” 

Experience has shown that other 
factors remaining constant, the greater 
the distance of the source of sound 
from the microphone the greater the dis- 
tortion. Obviously then one thing which 
can be done in classroom recording is 
to use several microphones so placed in 
tne room as to bring every child to 
within six or eight feet of one of then. 
The audio-frequency amplifier must pro- 
vide a channel and fader for each micro- 
phone used. Many of the acoustical dif- 
ficulties can in a measure be overcome 
by the proper placing of the microphones 
in relation to the sources of sound and 
the acoustical conditions of the room. 
Sound absorbing materials such as rugs, 
bulletin boards, drapes, etc., help to 
reduce acoustical distortion. Open win- 
dows will also help very materially in 
this respect. 

In concluding this discussion 
attention should be called égain to some 
of the uses of sound recording instru- 


/_ments in education.§8 


1. Sound-recording instruments 


| may be employed in making objective rec- 


ords of classroom instruction for pur- 
poses of research. 

2. Sound-recording instruments 
may be employed in the improvement of 
teachers in service. By the use of 
such an instrument teachers are enabled 
to listen to records of their own teach- 
ing and thus get a better picture of 
their teaching activities. These rec- 
ords can also be used in the analysis 
and improvement of teaching. 

3. Sound-recording instruments 
may be used in the institutional train- 
ing of teachers. Libraries of records 
may be developed to illustrate various 
types of classroom procedure. Such 





7. Alexander Wood, Sound Waves and Their Uses, (London: Blackie and Son Limited, 1930), pp. 112- 


125. 
8. For a discussion of this topic see, Karl Windesheim, op. cit., pp. 230-248. 
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their own difficulties in speech, oral 
reading, music, etc. 

6. Sound-recording instruments 
may be used to make and give standard- 
ized achievement tests. With sound 
tests recorded on disc such factors as 
test directions, time allowed for each 
item of the test, etc., can be made 
practically uniforn. 





A STUDY OF LATERALITY TEST ITEMS 


by 


Catharine J. Hull 
Speech Clinic, University of Minnesota 
inneapolis 


In the last few years, laterali- 
ty has assumed a significant role bothin 
the experimental laboratory and in clin- 
ical remedial work. It has been related 
not only to observable peripheral activ- 
ity, but its confusion has been consid- 
ered symptomatic of certain dysfunctions 
of the central nervous system. Moreover, 
recent work in aphasia has demonstrated 
that speech has a unilateral localiza- 
tion in the cerebral cortex, and that 
this localization is correlated directly 
with hand preference.2 

Since unilateral motor activi- 
ties have been widely accepted as periph- 
eral manifestations of this cerebral one- 
sidedness,* it is desirable to obtain a 
test of the side preference in these one- 
sided activities. As it is usually im- 
possible to obtain an actual performance 
of each one of these activities, the best 
solution of the problem is to secure the 
information from a questionnaire. If this 
obtained inrormation is to be used as a 
basis of research and clinical recommen- 
dations, it is imperative that it be ob- 
tained from a questionnaire which is sig- 
nificantly valid and reliable. Does the 
subject actually perform the unilateral 
activities as he indicates on the ques- 
tionnaire? Can his written answer be ac- 
cepted as a reliable one? This study was 
initiated in an attempt to answer those 
questions, and to discover which items 
might warrant inclusion in such 4 ques- 
tionnaire. 


Procedure 
Two questionnaires and two per- 
formance tests were given. The ques- 





tionnaire was composed of 40 items. Some 
of the items had been used in previous 
unstandardized questionnaires, and others 
related to ordinary one-sided activities 
were included. An attempt was made to 
include only those activities with which 
the average person was familiar, so spec- 
ulation could be reduced to a minimum, 
In the second questionnaire, the identi- 
cal items were included, but were ar- 
ranged in an entirely different order. 
Simple directions were printed at the 
top. of the list of questions, and were 
called to the attention of those taking 
the test. A minimum of four weeks was 
allowed to elapse between the adminis- 
tration of the first questionnaire and 
the first performance test. The same 
period of time was allowed before the 
next succeeding test. A copy of the 
questionnaire is given on the following 
page. 

All of the performance tests 
were given by the experimenter and two 
assistants, both of whom had been trained 
in their administration. Definite spoken 
directions were given before each act was 
performed, and the subjects were in- 
structed not to start until the instruc- 
tion was completed. The articles were 
arranged on a large laboratory table in 
positions which favored neither hand. 
Each subject proceeded around the table 
in clockwise fashion, facing the table. 
This was done because the majority of 
the subjects were right-sided, and in 
such a manner, any advantage from bodily 
position was afforded to the left side. 
An attempt was made to reduce the subjec- 
tive element to a minimum by giving only 








1. Weisenberg, Theodore, and MacBride, Katharine, "Aphasia", Commonwealth Fund, New York, 1955, 
pp. 435, 451-52. : 
2. Travis, Lee Edward, "Speech Pathology", Appleton Co., New York, 1931, p. 59. 
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Name 


Grade in School 


Age Sex 





Present Date 








This is a test to determine which side you use in manual activities. 


In the following questions, encircle the letter R if you perform the cer- 


tain activity with the right hand, L if you perform it with the left hand, and the 


letter E if you can perform it easily with either hand. 


ties, consider your hands empty when you begin to perform then. 


NIOuoe wn re 


10. 
ll. 
12. 
13. 
14. 
15. 
16. 
17. 
18. 
19. 
20. 
Ql. 
22. 
23. 
24. 
25. 
26. 
27. 
28. 
29. 


30. 
31. 
32. 
33. 


34. 
35. 
36. 
37. 
38. 
39. 
40. 


» Which 
- Which 


With which 


foot do you kick a ball? 


When you cross your legs, which one is on top? 
When hopping on one foot, on which foot do you put your weight? 


- Which hand 


Which hand 
Which hand 


holds a hammer while hammering? 
uses a can opener? 
holds the scissors (shears) while cutting? 


Which eye remains open when you sight with one eye through a small 
hole in a piece of paper? 


hand 
hand 
hand 
hand 
hand 
hand 
hand 
hand 
hand 
hand 
hand 
hand 


Which 
Which 
Which 
Which 
Which 
Which 
Which 
Which 
Which 
Which 
Which hand 
Which hand 
With which 
With which 
Which hand 
Which hand 
With which 
Which hand 
With which 
Which hand 
hand? 

With which 
Which hand 
Which hand 
Which hand 
a straight 
Which hand 
Which hand 
From which 
Which hand 
Which hand 
Which hand 
Which hand 


distributes cards when dealing them? 

holds the handkerchief when you blow your nose? 
waves goodbye? 

Spins a top? 

strikes a match? 

winds a watch? 

holds a toothbrush? 

takes money from a4 purse? 

holds the knife in sharpening a pencil? 

directs the thread through the eye of a needle? 
holds the spoon when stirring in a bowl? 

holds the comb when you comb your hair? 

turns the pages in a book? 

takes the cork from a bottle of ink? 

hand do you write? 

hand do you use an eraser on paper? 

cuts with the knife when eating? 

uses a@ salt shaker? 

hand do you bounce a rubber ball on the floor? 
is on top when you applaud? 

hand do you draw a sketch or picture? 

turns the water faucet when you hold no glass in either 


hand do you pick up a penny from the floor? 

uses an eraser on the blackboard? 

throws a ball? 

is at the top of the handle when you sweep the floor with 
broom? 

holds a tennis racquet? 

is at the top of the handle when you rake? 
shoulder do you swing a bat? 

is at the top end of the handle when you shovel? 
pushes the light switch on the wall? 

puts the key in the door keyhole? 

turns the knob in opening a door? 


maw mw ww yw 


ee ee ee es) 


Dmm yw 


Dmwmwww www 


In all of these activi- 
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the direction for action, and asking no | TABLE I 
further questions. For the second per- 
formance test, which was given no less | (TABLE SHOWING % CASES WITH SAME AN- 
than a month after the second question- | SWER IN 2 TESTS) 
naire, the articles were arranged as be- 
fore, but in a different order, cor- | Item TiT, QiTi QiQe 
responding to that of the items in the Number (50 (220 (160 
second written form. cases) cases) cases) 
The subjects used were unselect- 
ed members of the beginning speech class-| 
es at the University of Minnesota, and 
members of the English classes of the 
University High School on the same canm- 
pus. In the first consideration of the | 
results, the two groups were tabulated 
separately. However, the distributions 
on individual items were so similar that 94 2.18 93.75 
all of the subjects were considered as 68 55.45 63.12 
one group. Since the age range was not 92 73.18 76.25 
great, and as the test was not dependent | 96 94.09 93.12 
upon education or intelligence, it was | 94 83.63 86.25 
believed that such a combination was per- | 96 91.81 91.25 
missible. The entire group contained | 100 92.72 90.62 
practically an equal number of men and 86 65.45 68.12 
women. A total of 220 subjects took the 96 98.18 93.75 
first questionnaire and the first per- 86 76.36 78.75 
formance test. 160 subjects filled out 98 89.54 88.12 
both questionnaires, and 50 subjects were 92 86.36 86.25 
given the second performance test as a 68 55.45 65.00 
check on the reliability of the first ac- 88 76.36 81.25 


tual performance of the activities. 100 99.09 97.50 
98 88.63 85.61 


Results 98 95.90 96.87 
In considering the data, actual 92 79.09 78.12 
contingency correlations were inadvisa- 82 70.90 76.87 
ble because the extremely heavy weight- 84 72.72 73.75 
ing of subjects in the right-handed group 100 95.90 96.25 
skewed the distributions. Since the only 88 50.90 62.50 
information desired was whether or not 82 53.63 65.00 
the acts were performed with the same 86 59.09 68.12 
hand in the two tests which were being 90 92.27 91.25 
compared, the method of percentages was 84 68.63 78.75 
believed adequate. Table I gives the 98 9€.36 96.25 
percentages thus obtained. 80 72.72 76.87 
From the percentages, it can be 88 89.09 91.25 
seen that certain items are consistently 74 79.54 84.37 
high in all three categories, while others 68 37.27 70.62 
rank consistently low. With the excep- 92 85.90 84.37 
tion of batting, the bimanual activities 74 47.72 66.87 
of sweeping, batting, raking, and shovel- 
ing, in which the supposed lead hand has Legend 
been considered symptomatic of sidedness, | T, - 1st performance. Q, - lst questionnaire. 
showed a low relationship between the Tg - 2nd performance. Qz2 - 2nd questionnaire. 














90 92.72 86.25 
72 47.72 61.87 
88 50.45 70.00 
94 95.90 97.50 
98 80.90 95.00 
98 97.27 94.37 
88 57.27 62.50 
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written answer and the 

In batting, 89.09% | 

act as they 
for sweep- 


marison of the 
performance. 
subjects performed the 

The percentage 


actual 
a 
if the 


had indicated. 


ing was 68.63, for shoveling 79.54, and 
r raking 72.72. It seems, therefore, Be 
that the questionnaire answer is not a 


criterion of the performance of 
activities. 


reliable 
these latter three 
Limitations of Study 

As in any random sampling of stu- 
jents, there were relatively few left- 3. 
sided and ambidextrous subjects included 
in the study. a more uniform number 
of students could be obtained in each 
handedness group, a skewed distribution 
would be avoided, and correlations of 
contingency could be used for 4 more re- 
fined measure of relationship between the 
tests. Even with such a sampling, how- 4. 
ever, a correction would have to be made, 


— 
lf 


as the correlation would include only a 
three-way table. 
he objective method of adminis- 


tering the performance test may have low- 
ered the reliability by the acceptance 
of the side first choosing to perform the 
act, without the consideration of the 
possibility that the other side might also 
be able to carry out the activity. It is 
believed, however, that this influence 
was small, as students who could use 
either hand with equal efficiency admit- 
ted that fact without solicitation from 
the examiner in their attempts to carry 
out the directions. The information de- 
Sired was not whether or not either side 
was able to perform the act, but wnich 
Side did perform that act easily in the | 
majority of cases. 6 
Some one-sided acts quite fre- | 7. 
quently performed by the average individ-| 
ual may have been omitted from theinitial! 8. 
list, but it is believed that tne 40 9. 
items provided an adequate sampling for 


the testing. | 10. 

Summary ll. 

1. A performance test of 40 items given 12. 
twice to 50 students yielded 21 items 
in which the students performed the 
activity with the same side in over 
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90% of the cases. This test-retest 
indicated a high enough reliability 
of performance to warrant acceptance 
of the tests as validation of the 
same items on the questionnaire. 

On duplicate questionnaires adminis- 
tered to 160 students, 14 of the re- 
liable performance items were an- 
swered the same over 90% of the time. 
This showed a high reliability of 
those items on the written question- 
naire. 

In comparing the answers on the first 
questionnaire and the first perform- 
ance test, 14 items were answered 
identically over 90% of the time. 
This proved that those items were 
valid ones for use in a question- 
naire, using the actual performance 
as a validating measure. 

Of the items which were proved sig- 
nificantly reliable on test-retest of 
performance and questionnaire, and which 
were indicated as valid by compari- 
son of the written answer and the 
actual performance, 12 items were an- 
swered identically in over 90% of 
the cases in all three categories. 
This high degree of relationship per- 
mits use of the following items in a 
sidedness questionnaire. 


Which hand holds a hammer while ham- 
mering? 

Which hand holds the scissors while 
cutting? 

Which hand distributes cards while 
dealing them? 
Which hand spins 
Which hand winds 


a top? 

a watch? 

a toothbrush? 
Which hand holds the knife in sharp- 
ening a pencil? 

With which hand do you write? 

Which hand cuts with the knife when 
eating? 

With which hand do you draw a sketch 
or picture? 

Which hand throws a ball? 

Which hand holds a tennis racquet? 








TEACHING AND EDUCATIONAL INVENTIONS 


by 


Ibert Mellan 
Philadelphia, Pennsylvania 


Among the most recent of educa- 
tional problems, and perhaps the most 
widely discussed, is that classroom pro- 
‘edure known as instruction by mechani- 
cal devices. Since the World War active 
steps have been taken to equip hundreds 

f classrooms with appropriate apparatus 
to demonstrate the part vision plays in 
the pursuance of efficient teaching 
methods. 

It is interesting to find that 
the attention given to visual instruc- 
tion goes farther back in educational 
history than would be otherwise supposed 
by this only recent attention to its pos- 
sibilities. 

The writer has been fortunate in 
uncovering a vast wealth of material hid- 


den and filed away in the United States 
Patent Office which may throw much light 


in the way of educational study. The 
following lists of patents are, for the 
most part, concerned with teaching and 
educational devices, appliances, appara- 
tus, etc. The earliest record in the 
United States Patent Office is the con- 
tribution of H. Chard, dated February 16, 
1809, who gives us a "Mode of Teaching to 
Read." §. Randall, dated October 1, 1810, 
and January ll, 1812, offered a patent 
entitled "Mode of Teaching to Write." 

Although not of the earliest con 
tributors to this patent literature, the 
distinguished name of Dr. Maria Montes- 
sori will be found among them. Her pat- 
ent, No. 1103369 (1914), offered an edu- 
cational device for properly training 
the sense of touch, which, in her opin- 
ion, is an essential factor in the prop- 
er development of the child. 

Curiously, following these first 
two pioneers, patent literature on edu- 
cation and teaching was noticeably infre- 
quent until 1870, after which the more 





inven- 
interest 


vigorous output of educational 
tions seemed to mark a greater 
in teaching and education generally. 
Each succeeding year saw a continual in 
crease in the output so that up to the 
present time there are between 600 and 
700 inventions issued on the subjects of 
teaching and education. 

To keep pace with the never end- 


| ing variety of scientific progress, ed- 


ucators must take advantage of every val- 
uable contribution of modern science 

and industrial and educational inven- 
tion. I, therefore, believe that an in- 
telligent study of these patents is nec- 
essary for all schoolmen. This is a 
unique opportunity for educators, with 
their experience and years of training, 
to develop an abundance of adequate 
teaching material from this vast amount 
of literature which contributes to every 
type of instruction and to every subject 
in the curriculum. It may be that many 
of these patents are useless today be- 
cause they may have been outmoded in the 
face of constant advancement in educa- 
tion and teaching methods; however, here 
is the premise of the educator who must 
decide between the useful and the unfit 
material. 

Copies of these patent specifi- 
cations may be obtained from the Commis- 
Sioner of Patents, Washington, D.C. The 
cost is ten cents a patent. In order- 
ing give patent number and title of the 
invention. 

With the aid of the following 
key and the patent number an approximate 
time when the patent was issued can be 
determined. 
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Patent Number Date When Issued 
1 1790 
32000 1820 
60000 1840 
110000 1860 
230000 1880 
443000 1890 
532000 1895 
667000 1900 
817000 1905 
980000 1910 
1138000 1915 
1364000 1920 
1568000 1925 
1737000 1930 
1892000 1932 


PATENT RELATING TO THE TEACHING OF ARITHMETIC 


Title of Invention 





Teaching Arithmetic 
n f 


Means of Teaching Fractions 
Device for Teaching the Metric System 


Apparatus for Teaching Arithmetic 
Device for Teaching Involution and Evolution 


Device for Teaching Arithmetic 
f n fn nf 


Apparatus for " 
" 


Device . 8 


Apparatus " * ° 


Educational Appliance 


Means of Teaching Fractions 
Device for Teaching Fractions 


Device for Teaching Fractions and Percentage 
Apparatus to Fecilitate the Teaching of Notations and Numerations 


Device to Aid in Teaching Arithmetic 
Apparatus for Teaching Arithmetic 


Device for Teaching Numbers to Children 
. ° ° Arithmetic 


Device for Teaching Combinations of Numbers 
Teaching Arithmetical Calculations 


Apparatus for Teaching Arithmetic, etc. 
n n fn f fn 


Patent Number 





4632 (1846) 
149235 


151971 
176735 


196583 
209385 


214822 
215916 


234247 
262191 


264572 
296018 


342651 
356167 


383300 
384959 


389415 
390824 


416593 
431102 


452302 
462376 


502184 
588371 
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Title of Invention 





Device for Teaching Computations 
Education Device for Teaching Sperica 


Device for Teaching the Fundamental Operation with Numbers 
Device of Facilitating Teaching of Fraction 
Apparatus of Teaching and Learning Arithmetic 


Apparatus of Teaching and Learning Arithmetic 
Device for Teaching Fractional Value 


Device for Teaching Fractional 
® . * Numbers 


Device for Teaching Arithmetic 
Appliance for Teaching Arithmetic 


Device for Teaching Arithmetic 
Appliance for Teaching Arithmetic 


Appliance for Teaching Arithmetic 
Device Used in Teaching Geometry and Trigonometry 


Device for Teaching Numbers in Combination, Analysis Factors and 
Multiple 
Appliance for Teaching Arithmetic 


Device for Teaching Arithmetic 
Apparatus for Teaching Arithmetic 


Device for Teaching Division 
380532; 704979; 708568; 1248238; 666999 
CHEMISTRY 
Apparatus for Teaching Chemistry 
Appliance for Teaching Chemistry 


DRAWING 


Device for Teaching Drawing 
Cards for Teaching Drawing 


Device for Teaching Drawing 
Apparatus for Teaching Drawing 


Educational Art Text Sheet 
Charts for Teaching the Reading of Drawings 


Device for Teaching Drawing 


GEOGRAPHY 


Apparatus for Teaching Geography and Astrography 
Teaching Geography 


Educational Device for the Illustration of Longitude and Time 
Educational Globe 


Patent Number 





604953 
629891 


812408 
816204 


841158 
846484 


856068 
1043652 


1098330 
1151279 


1129890 
1174689 


1211625 
1218931 


1405010 
1541179 
1594396 
1662503 


1728584 
1730418 


1818566 


242821 
480275 


171268 
282659 


471442 
651791 


720187 
1049241 


1617207 


2426 (1842) 
143934 


526629 
418455 








P 
<9% 
4 ] YT 2 
Title of Invention 
paratus for Teaching Astronomy and Geography 
| | pp . a 
. " . Elements of Geography 
g Geography 
Tay 
ISTORY 
¥ "ee - ’ “ s «+ ~ 
ADI ratu for Teach ng i Story 
’ ‘ . 'T ‘ y » TT { © 4 
hart for Teaching Universal tor} 
— 
MUSIC 
© ) T. A 'y « bh 4 oO ‘ 4 
nal Table for Teaching Music 
+4 ¢ T ~« Lin c4 _— 
A . € yr ueacning pinging 


T . » ¢ ‘ sed, 7 
Device for Teaching Musical 
fn 


Transposition 
fn 


ah 4 1 +4 
vey € r eacning Music 
fn , . fn 


far T } 
LOr ieac 


1ing Music 
Teaching Music 


He 


evi 
. ins » + 
ndicator for 


Device for Teaching Music 


yn f © , 4 Maiiel 
for Teaching Music 


Device for Teaching Music 
n " ® Vocal Music 


Apparatus for Teaching Melodies 
4 
i 


Device for Teaching Time in Music 
Device Teaching Musical Notation 


fan 
AUL 
Game-Cards for Teaching Music 


Device for Teaching Music 
f fn fn a 


Device for Teaching Music and Singing 
Apparatus for Teaching Music 
Apparatus for Teaching Music 

Device ® = ® 

Device for Teaching the Playing of Stringed Instruments 
Device for Teaching Music 

Device for Teaching Music 

Chart " " . 


Apparatus for Teaching Music 
® ® ® Sight-Singing 


Piano Teaching Mechanism 
Apparatus for Teaching Piano 
699510; 695447; 692736; 672678; 675723 
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Patent Number 


501136 
646582 


1042455 


198749 
1406173 


176471 
181312 


183103 
197497 


203210 
264932 


367156 
417734 


435131 
451010 


510302 
528310 


577667 
585681 


618611 
621323 


632137 
657953 


736960 
752836 


762990 
788063 


830915 
964593 


979193 
1058831 


1600052 
1685682 


1732377 
539191 








Title of Invention 


PENMANSHIP-WRITING 


Yode of Teaching to Write (S. Randall) Filed Jan. il, 
" . ” " " " ° Filed Oct. l, 


lummets of Lead in Teaching Writing (T. Weston) Filed June 
Diagram for Teaching Penmanship 


Sopy-slips for Teaching Penmanship 
nies n n fn 


yages for Teaching Penmanship 
Device " ” . 188984 
Device for Teaching Penmanship 217499 
iland-guide for Use in Teaching Penmanship 226942 


Appliance for Teaching Penmanship 364249 
n n n 


n Aan 
414300 


for Teaching Penmanship 503796 
” " n 507950 


Teaching Penmanship 510372 
" " 726898 


Teaching Penmanship 735782 
n n 757383 
oo wv “ 


Teaching Penmanship 764970 
. ” 783496 


"YQ01K0 
(JILVE 


Device Teaching Penmanship 
" 809712 


n fn 


Device Teaching Penmanship 862004 
n " 940744 


for Teaching Penmanship 972273 
” = . 1136450 


1184155 


Device for Teaching Penmanship 
592029 


Apparatus for Teaching Penmanship 


PHYSICAL 


Apparatus for Teaching the Art of Swimming 149249 
. . e Swimming 206892 


Apparatus for Teaching the Art of Boxing 426978 
. e . Diaphragmatic Breathing 537516 


Apparatus for Teaching Swimming 563578 
e e e Children to Walk J 58464 


946886 


Apparatus for Teaching Swimming 
748532 


Teaching the Golf-swing 


1703403 


Mechanical Figure for Teaching Golf 
1815443 


Self-Teaching Walking and Dancing 
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Invention Patent Number 


READING 


hard) Filed Feb. 16, 1809 
4 


y 
is 


ects 569846 
660255 


° 
r #4 Ford ) 
iadentilying VU 

neading 


and Practicing of Reading 822937 
1010782 





1224742 
1263626 





Teaching Reading 1278425 
hoa Reading and the Like 158462 


1514270 
1706550 


SPELLING 
, -, 52758 


a ‘ Ne 2 _ , » 14 
Apparatus for Teaching ppeiiing 


: ors . , 189535 


r for Teaching Word-Analysis 


Appar atus for 218306 
Kindergarten Apparatus of Teaching Spelling 542737 


Kindergarten Apparatus of Teaching Spelling 366821 
" Game " e . 386845 


Kindergarten Apparatus of Teaching Spelling 371815 
Apparatus for Teaching Spelling 542076 


505807 


Teaching Spelling 
" n 792801 


for Kindergarten 


Means for Teaching Spelling and the Like 1099324 
" " " the Alphabet 1270566 


Means for Aiding in Spelling and Phonics 1326695 


TYPEWRITING AND BOOKKEEPING 
Teaching Business Practice 534723 
Machine for Teaching Touch Typewriting 823362 


844025 


Device for Teaching Touch Typewriting 
+ 4 1008591 


. Typewriting 
1027514 
1415278 


Means for Teaching Shorthand 
Apparatus for Teaching Bookkeeping 


694944; 898114; 665991; 678618 


TELEGRAPHY 


522454 


Machine for Teaching Telegraphy 
736936 


Teaching and Practice of Telegraphy 














March, 1936 


Teaching Reading and Sending 
Instrument for Teaching and P 


Apparatus for Teaching Telegraphy 
Teaching Wireless Telegraphy 


Teaching Wireless Telegraphy 
” Telegraphic Codes 


Educational 
f 


Educational 
fn 


Educational 
n 


Educational 
n 


Educational 
fn 
Educational 
n 
Educational 
fn 
Educational 
n 


Educational 
n 


Educational 
fn 


Educational 
Educational 
Educational 
Educational 
Educational 


Educational 
fn 


Educational 
nf 


Educational 
n 





Title 


Block 
fn 
Block 
n 
Block 
fn 
Block 
f 


Block 
Ball 


Construction Set 


Folder 


Frame 
Concentrator 


Game 
fn 
Game 
n 


Implement 
Model 


Puzzle 
fn 


elegraphic 
icing Tele 


Ibert Mellan 


Invention 


Des. 64967 June 


1354910 


1377261 


1Anne 


1477255 
1527051] 
1864702 


1571488 
1599568 


710652 
778928 


874152 
1587928 


1CR20£CA 
1163254 


1670254 


ry anor 
f 1852 
nr 


770841 


ae 4 oc 
1533 fk 


1551895 
581178 
1767424 
966473 
639941 


364465 
740451 
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Title of Invention 


‘ N 
it na N € 
” : ‘ 
Mate A 4 
Ue 4 
lucat 14 Ma lal 
* nf 
+4 a1 Ws ; 
i nal Material 
. ¢ r 


W *K 4 21a 
ational System 

. Top 

ational neet 

" n ta 


Educational Test Device 


cational Device for Object Teaching 
” - " Teaching Spelling 


Educational Device for Interchangeable Topics 
. Demonstrating Device 


Figure for Educational Purposes 
, f n n 


Apparatus for Teaching Gun Practice 
fn n fn n fn 


f = arrare a rT’ . n4 » & rc +4 a 
Apparatus ior ieacning wnooting 
fn 


Means of Teaching Sewing 
Teaching Embroidery 


Tesehing EFmhroiad 
sta llTig BM@DPrToiacel 


y 
Means for Teaching Parliamentary Law 

Means of Teaching Facial Expressions which Occur in Speaking 
Teaching the Deaf to Hear and the Mute to Speak 

Means of Teaching Aviation and Testing Aeroplanes 

Apparatus for Teaching the Art of Aeroplaning 


Apparatus for Teaching in Kindergarten 
a . " Projection 


Implement for Teaching Fingering 
Magnetic Indicator for Teaching 


Appliance for Teaching Botany 
Device for Teaching English 


Time Teaching Apparatus 
Teaching the Reading of Clocks and Dials and the Distinguishing of 
Color 


Chart for Teaching Physiology 
Puzzle-Card for Object Teaching 


“Ui 4 


Patent Number 





971865 
736448 


853042 
1656030 


1868823 
1535188 


305585 
1627211 


656569 
1491123 


1586628 
1490934 


1843183 
1867511 


1343721 
1619160 


1729241 
1802331 


630217 
1841369 


7352596 
751591 


944368 
707729 


537609 
778162 


1532110 
946868 


723716 
1676716 


1007467 
1018645 


653838 
1327474 


1666464 
126761 


496256 
1539397 


825902 
1143519 


298746 
274799 
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Title of Invention Patent Number 





Joject-Teaching Frame 248659 
e * 
chart for Object-Teaching 197279 


Educator 523338 
" 1536180 


Educator 1561744 
. 1587864 


Educator 1867876 
Educational Article 1502006 


Educational Amusement Device 771394 
n " n 973186 


Educational Amusement Device 1075248 
n ° ® 1120681 


Educational Amusement Device 1386248 


THE FOLLOWING PATENTS ARE INDIVIDUALLY TITLED "EDUCATIONAL 
APPARATUS"; ALTHOUGH THEY MAY REFER TO DIFFERENT 
SUBJECTS OF THE CURRICULUM 


1370826 1433852 1634289 1810745 
1384801 1476671 1636234 1807615 
1392258 1516097 1662272 1831383 
1394620 1523188 1700946 1839558 
1417434 1531070 1705315 1860895 
1428456 1617272 1766355 1862872 


THE FOLLOWING PATENTS ARE INDIVIDUALLY TITLED "EDUCATIONAL 
APPLIANCES"; ALTHOUGH THEY MAY REFER TO DIFFERENT 
SUBJECTS OF THE CURRICULUM 


281770 528010 627365 791743 1018146 
309064 530450 629046 791709 1028212 
375095 511470 636182 802807 1050327 
385046 574815 641151 811169 1054890 
588486 577196 641738 814653 1084370 
401043 536497 641739 819847 1093690 
421044 547217 645440 831154 1099372 
466296 553533 646661 832331 1158774 
445782 555026 667397 853756 1163125 
446468 556431 669878 864090 1161685 
448019 563872 672062 873667 1129019 
463475 564396 674482 882463 1138807 
468475 560964 690446 684863 1141340 
472419 572972 690664 885273 1141409 
474773 589180 692019 898754 1150550 
476468 595455 691113 952652 1142651 
480164 600234 706463 951606 1168949 
484431 602222 713085 958987 1182636 
487695 610277 715224 964064 1183068 
496993 617883 723425 967591 1183570 
500824 618198 730859 971185 1189233 
501675 622551 746305 977793 1169510 
505667 623558 747711 997986 1204089 
521360 626423 777268 1013856 1204854 
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1206795 
1213839 
1228391 
1229142 
1240031 
1243957 
1244000 
1253908 
1256224 
1256997 
1257655 
1262269 
1264449 
1260601 
1269713 
1279504 
1269784 


196532 
288628 
302194 
335838 
383389 
445759 
544714 
618114 
624614 
625881 
641283 
641683 
649054 
663287 
676313 
677652 
680695 
683267 
683171 
687288 
687570 
688388 
696690 
701997 
711486 
711879 
720510 
731175 
755837 
763926 
774998 
784145 
793676 
793767 
794005 
795855 
801316 
832871 
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1270668 

275955 
1286158 
1286232 
1286631 
1289743 
1289849 
1281295 
1291045 
1294126 
1295404 
1305742 
1310997 
1315478 
1322261 
1329850 
1329896 


838840 
871934 
886172 
889515 
892715 
894043 
925716 
927499 
935515 
940093 
947064 
969309 
969429 
1023586 
1028378 
1038332 
1041059 
1043596 
1052460 
1053598 
1071358 
1077515 
1085405 
1087186 
1100362 
1103369 
1113237 
1132409 
1136663 
1139256 
1142947 
1148616 
1163184 
1186267 
1184326 
1170537 
1196099 
1209612 


1335126 
1343095 
1364778 
1368192 
1369640 
1375308 
1491986 
1435660 
1456395 
1465699 
1470845 
1471437 
1479147 
1480458 
1483916 
1498121 
1502991 


1218993 
1228197 
1232021 
1233544 
1240556 
1245475 
1305449 
1321292 
1332761 
1346929 
1349930 
1350237 
1354692 
1356929 
1359115 
1383097 
1385096 
1394305 
1396379 
1400887 
1378874 
1405063 
1405193 
1414467 
1417828 
1419882 
1428206 
1426997 
1437037 
1445819 
1455522 
1446941 
1457223 
1457468 
1469919 
1477322 
1479423 
1479876 


1511124 
1535056 
1538929 
1538930 
1539194 
1543067 
1582810 
1587026 
1603201 
1607329 
1624450 
1630939 
1664842 
1673166 
1679536 
1689422 
1694405 


1484883 
1485190 
1486690 
1494872 
1497150 
1506210 
1509889 
1519426 
1521491 
1523047 
1530418 
1532437 
1539909 
1535706 
1541795 
1542031 
1549673 
1559665 
1560994 
1561447 
1562518 
1573358 
1578665 
1583061 
1586960 
1587685 
1581390 
1597177 
1598499 
1595115 
1597562 
1605697 
1614390 
1617831 
1617659 
1681560 
1621262 
1634713 


THE FOLLOWING PATENTS ARE INDIVIDUALLY TITLED "EDUCATIONAL 
APPLIANCES"; ALTHOUGH THEY MAY REFER TO DIFFERENT 
SUBJECTS OF THE CURRICULUM 


1696237 
1699289 
1704297 
1742781 
1751106 
1764446 
1677521 
1767774 
1775603 
1784952 
1801724 
1802492 
1811105 
1842165 
1857009 
1869522 
1874279 


THE FOLLOWING PATENTS ARE INDIVIDUALLY TITLED "EDUCATIONAL 
DEVICES"; ALTHOUGH THEY MAY REFER TO DIFFERENT 
SUBJECTS OF THE CURRICULUM 


1641773 
1645008 
1647276 
1664390 
1664808 
1674553 
1678621 
1696988 
1729518 
1732815 
1732980 
1734115 
1735456 
1734544 
1745674 
1750799 
1756208 
1762864 
1781047 
1791982 
1798647 
1804813 
1812110 
1820209 
1833793 
1837194 
1838942 
1840507 
1847815 
1854999 
1855823 
1856650 
1860483 
1864022 
1866133 
1882575 
1884476 
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CALCULATION OF CHRONOLOGICAL AGES 
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Frances Swineford 


University of Chicago 


There was published recently a & d = difference in months between 
papert presenting a technique for build- | month of given date and birth 
ing tables which would simplify the cal- | month, 

‘ulation of chronological ages as of a 
given date. According to the method |} then chronological age is given approxi- 
iescribed, it would be necessary to build | mately by the formula 

a new table for each date for which the 

ages are to be calculated. In the case ze = ; - B)years + d months (1) 
wnere the ages are desired accurate to 

the nearest month, this becomes a rather d, of course, is negative if the true 
lengthy process, and the present method age is less than G - B. 

is suggested in its stead. In order to eliminate all nega- 

A single table has been con- tive values, and thus simplify the work, 
structed which can be applied for any formula (1) can be rewritten in the form: 
given date, with certain corrections to 
be described below. If we let Age = [(G - 1) - B] years + T (2) 


= year of given date, where T = d months + 1 year. The values 
= year of birth, of T are given in Table I. 
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VALUES OF T 
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- Buros, Oscar K. "A Simple Technique for the Calculation of Chronological Ages", Journal of Ed- 
ucational Research, XXVI (Jenuary, 1933), pp. 360-363. 
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I rest month i li ). Since ¢ h month 
ass umed to mntain 30 days, a correc 
“ . ae s 
n must be made f a difference of 
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re than fifteen days between given date 
and birth date. The rrected formula 
LiOwss$ 
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_ 
| pose the birth dates of four individual: 
to be (a) March 25, 1919, (b) July 12, 
1917, (c) December 2, 1922, and (d) No- 
vember 30, 1914. It is desired to find 

their ages on August 8, 1932. 


(1) Determine G - 1 and subtract 


from it each value of B in turn. Here, 
G-1= 193 (The results of steps (1) 
to (4) are summarized in Table II.) 


(2) Find the values of T in the 
appropriate column and rows of Table I. 
In the example, all the values are from 


column 8 (Aug.) since August is the 
month of the given date. T for indi- 
vidual (a) is 1-5, and is read, "one 
five months." 

(3) Note the days of the month 
for which k is different from zero. Here, 
k = -l for every birth date > 24. For 
all other birth dates k = 0. 

(4) Add the value of k in 
months to the sum of the results of 
steps (1) and (2). 


year, 


TABLE II 


JF THI 
OF FOUR 


SUMMARY STEPS 
AGES 


IN CALCULATING THE CHRONOLOGICAL 
INDIVIDUALS AS OF August 8, 1932 
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Steps 





Na+ 
vate 


Birth 


(2) 





March 25, 1919 
July 12, 1917 
December 2, 1922 
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November 30, 1914 











